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^Pgenes identified as required for prolifera^Win 

escherichia cou 
background of the invention 

Since the discovery of penicillin, the use of antibiotics to treat the ravages of bacterial infections has saved 
5 millions of lives. With the advent of these "miracle drugs," for a time it was popularly believed that humanity might, once 

and for all, be saved from the scourge of bacterial infections. In fact, during the 1980s and early 1990s, many large 
pharmaceutical companies cut back or eliminated antibiotics research and development. They believed that infectious 
disease caused by bacteria finally had been conquered and that markets for new drugs were limited. Unfortunately, 
this belief was overly optimistic. 

10 The tide is beginning to turn in favor of the bacteria as reports of drug resistant bacteria become more frequent. 

The United States Centers for Disease Control announced that one of the most powerful known antibiotics, vancomycin, 
was unable to treat an infection of the common Staphylococcus aureus (staph). This organism is commonly found in our 
environment and is responsible for many nosocomial infections. The import of this announcement becomes clear when 
one considers that vancomycin was used for years to treat infections caused by stubborn strains of bacteria, like 

15 staph, in short, the bacteria are becoming resistant to our most powerful antibiotics. If this trend continues, it is 

conceivable that we will return to a time when what are presently considered minor bacterial infections are fatal 
diseases. 

There are a number of causes for the predicament in which practitioners of medical arts find themselves. Over- 
prescription and improper prescription habits by some physicians have caused an indiscriminate increase in the availability 
20 of antibiotics to the public, the patient is also partly responsible, for even in instances where an antibiotic is the 

appropriate treatment, patients will often improperly use the drug, the result being yet another population of bacteria that 
is resistant, in whole or in part, to traditional antibiotics. 

The bacterial scourges that have haunted humanity remain, in spite of the development of modern scientific 
practices to deal with the diseases that they cause. Drug resistant bacteria are now advancing on the health of humanity. 
25 A new generation of antibiotics to once again deal with the pending health threat that bacteria present is required. 

Discovery of New Antibiotics 
As more and more bacterial strains become resistant to the panel of available antibiotics, new compounds 
are required. In the past, practitioners of pharmacology would have to rely upon traditional methods of drug discovery 
to generate novel, safe and efficacious compounds for the treatment of disease. Traditional drug discovery methods 
30 involve blindly testing potential drug candidate-molecules, often selected at random, in the hope that one might prove 

to be an effective treatment for some disease. The process is painstaking and laborious, with no guarantee of 
success. Today, the average cost to discover and develop a new drug is nearly US $500 million, and the average time 
is 15 years from laboratory to patient. Improving this process, even incrementally, would represent a huge advance in 
the generation of novel antimicrobial agents. 
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Newly. emergiriOractices in drug discovery utilize a number of biochemical techniques to provide for 
directed approaches to creating new drugs, rather than discovering them at random. For example, gene sequences and 
proteins encoded thereby that are required for the proliferation of an organism make for excellent targets since 
exposure of bacteria to compounds active against these targets would result in the inactivation of the organism. Once 
5 a target is identified, biochemical analysis of that target. can be used to discover or to design molecules that interact 

with and alter the functions of the target. Using physical and computational techniques, to analyze structural and 
biochemical targets in order to derive compounds that interact with a target is called rational drug design and offers 
great future potential. Thus, emerging drug discovery practices use molecular modeling techniques, combinatorial 
chemistry approaches, and other means to produce and screen and/or design large numbers of candidate compounds. 
10 Nevertheless, while this approach to drug discovery is clearly the way of the future, problems remain. For 

example, the initial step of identifying molecular targets for investigation can be an extremely time consuming task. It 
may also be difficult to design molecules that interact with the target by using computer modeling techniques. 
Furthermore, in cases where the function of the target is not known or is poorly understood, it may be difficult to 
design assays to detect molecules that interact with and alter the functions of the target. To improve the rate of 
15 novel drug discovery and development, methods of identifying important molecular targets in pathogenic 

microorganisms and methods for identifying molecules that interact with and alter the functions of such molecular 
targets are urgently required. . 

Escherichia coli represents an excellent model system to understand bacterial biochemistry and physiology. 
The estimated 4288 genes scattered along the 4.6 x 1 0 6 base pairs of the Escherichia coli (E coii) chromosome offer 
20 tremendous promise for the understanding of bacterial biochemical processes. In turn, this knowledge will assist in 

the development of new tools for the diagnosis and treatment of bacteria-caused human disease. The entire E coli 
genome has been sequenced, and this body of information holds a tremendous potential for application to the 
discovery and development of new antibiotic compounds. Yet, in spite of this accomplishment, the general functions 
or roles of many of these genes are still unknown. For example, the total number of proliferation-required genes 
25 contained within the E. coli genome is unknown, but has been variously estimated at around 200 to 700 (Armstrong, 

K.A. and Fan, D.P. Essential Genes in the metB-malB Region of Escherichia coli K\2, 1975, J. Bacterioi. 126: 48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the rapid rise of antibiotic resistant 
microorganisms. However, prior to this invention, the characterization of even a single bacterial gene was a 
painstaking process, requiring years of effort. Accordingly, there is an urgent need for more novel methods to identify 
30 . and characterize bacterial genomic sequences that encode gene products required for proliferation and for methods to 

identify molecules that interact with and alter the functions of such genes and gene products. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention is a purified or isolated nucleic acid sequence consisting essentially 
of one of SEQ ID NOs: 1-81, 405-485, wherein said nucleic acid inhibits microorganism proliferation. The nucleic acid 
35 sequence may be complementary to at least a portion of a coding sequence of a gene whose expression is required for 
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microorganism proliferation. The nucleic acid sequence may comprise a fragment of one. of SEQ ID NQs. 1-81, 405- 
485, said fragment selected from the group consisting of fragments comprising at least 10, at least 20, at least 25, 
at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. The nucleic acid 
sequence may be complementary to a coding sequence of a gene whose expression is required for microorganism 
proliferation. 

Another embodiment of the present invention is a vector comprising a promoter operably linked to a nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1-81, 405-485. The promoter may be 
active in an organism selected from the group consisting of Escherichia coli. Staphylococcus aureus, Pseudomonas 
aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecaiis, Streptococcus 
pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi, Salmonella paratyphi, 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema 
pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni. Chlamydia trachomatus. 
Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

Another embodiment of the present invention is a host cell containing the vectors described above. 

Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of 
the coding sequence of one of SEQ ID NOs: 82-88, 90-242. One aspect of this embodiment is a fragment of the 
nucleic acid comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive- 
bases of one of SEQ ID NQs: 82-88, 90-242. 

Another embodiment of the present invention isa vector comprising a promoter operably linked to the nucleic 
acids of the preceding embodiment. 

Another aspect of the present invention is a purified or isolated nucleic acid comprising a nucleic acid 
sequence complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences spanning at 
least a portion of two or more genes, 5' noncoding region, or 3' noncoding region within an operon encoding a 
polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is a purified or isolated nucleic acid comprising a nucleic acid 
having at least 70% homology to a sequence selected from the group consisting of SEQ ID NOs 1-81, 405-485, 82- 
88, 90-242 or the sequences complementary thereto as determined using BLASTN version 2.0 with the default 
parameters. The nucleic acid may be from an organism selected from the group consisting of Staphylococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecaiis, 
Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida 
albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella 
paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, 
Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 
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Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of a 
nucleic acid encoding a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 
359-398. 

Another embodiment of the present invention is a vector comprising a promoter operably linked to a nucleic 
acid encoding a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is a host cell containing the vector of the preceding 
embodiment. 

Another embodiment of the present invention is purified or isolated polypeptide comprising the sequence of 
one of SEO ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is purified or isolated polypeptide comprising a fragment of one 
of the polypeptides of SEQ ID NOs. 243-357, 359-398, said fragment selected from the group consisting of fragments 
comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the polypeptides of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is an antibody capable of specifically binding the polypeptide of 
the preceding embodiment. 

Another embodiment of the present invention is method of producing a polypeptide, comprising introducing a 
vector comprising a promoter operably linked to a nucleic acid encoding a polypeptide having a sequence selected from 
the group consisting of SEQ ID NOs. 243-357, 359-398into a cell. The method may further comprise the step of " 
isolating said protein. 

Another embodiment of the present invention is a method of inhibiting proliferation comprising inhibiting the 
activity or reducing the amount of a polypeptide having a sequence selected from the group consisting of SEQ ID NOs. 
243-357, 359-398or inhibiting the activity or reducing the amount of a nucleic acid encoding said polypeptide. 

Another embodiment of the present invention is method for identifying compounds which influence the 
activity of a polypeptide required for proliferation comprising: 

contacting a polypeptide comprising a sequence selected from the group consisting of 243-357, 359- 
398with a candidate compound; and 

determining whether said compound influences the activity of said polypeptide. 

The activity may be an enzymatic activity. The activity may be a carbon compound cataboiism activity. The 
activity may be a biosynthetic activity. The activity may be a transporter activity. The activity may be a 
transcriptional activity. The activity may be a DNA replication activity. The activity may be a ceil division activity. 

Another embodiment of the present invention is a compound identified using the above method. 

Another embodiment of the present invention is method for assaying compounds for the ability to reduce the 
activity or level of a polypeptide required for proliferation, comprising: 

providing a target, wherein said target comprises the coding sequence of a sequence selected from the group 
consisting of SEQ ID NOs. 82-88, 90-242; 
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contacting Si^PParget with a candidate compound; and 
measuring an activity of said target. 

The target may be a messenger RNA molecule transcribed from a coding region of one of SEQ ID. NOs.: 82- 
88, 90-242 and said activity is translation of said messenger RNA. The target may be a coding region of one of SEQ 
5 ID. NOs. 82-88, 90-242 and said activity is transcription of said messenger RNA. 

Another embodiment of the present invention is a compound identified using the method above. 
Another embodiment of the present invention is a method for identifying compounds which reduce the 
activity or level of a gene product required for cell proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene product in a cell to reduce the 
10 activity or amount of said gene product in said ceil, thereby producing a sensitized cell; 

contacting said sensitized cell with a compound; and 

determining whether said compound inhibits the growth, of said sensitized cell to a greater extent than said 
compound inhibits the growth of a honsensitized cell. 

The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal 
15 cells. The cell may be an £ coll cell.- The cell may be from an organism selected from the group consisting of 

Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylon) Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermldls, Mycobacterium tuberculosis, 
20. Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestls, Clostridium botulinum, Campylobacter 

jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. The antisense nucleic acid may be transcribed from an inducible promoter. The method may, further comprise 
the step of contacting said cell with a concentration of inducer which induces said antisense nucleic acid to a 
sublethal level. The sub-lethal concentration of said inducer may be such that growth inhibition is 8% or more. The 
25 inducer may be isopropyl-1 -thio p-D-galactoside. The growth inhibition may be measured by monitoring optical density 

of a culture growth solution. The gene product may be a polypeptide. The gene product may be an RNA. The gene 
product may comprise a polypeptide having a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 
359-398. 

Another embodiment of the present invention is a compound identified using the method above. 
30 Another embodiment of the present invention is a method for inhibiting cellular proliferation comprising 

introducing a compound with activity against a gene corresponding to one of SEQ ID NOs.: 82*88, 90-242 or with 
activity against the product of said gene into a population of cells expressing a gene. The compound may be an 
antisense oligonucleotide comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-81, 405-485, 
or a proliferation-inhibiting portion thereof. The proliferation inhibiting portion of one of SEQ ID NOs. 1-81, 405-485 
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may be a fragmenmmprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 
consecutive bases of one of SEQ ID NOs: 1-81, 405485. The compound may be a triple helix oligonucleotide. 

Another embodiment of the present invention is a preparation comprising an effective concentration of an 
antisense oligonucleotide comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-81, 405-485, 
5 or a proliferation-inhibiting portion thereof in a pharmaceutical^ acceptable carrier. The proliferation-inhibiting portion 

of one of SEQ ID NOs. 1-81, 405-485 may comprise at least 10, at least 20, at least 25, at least 30, at least 50 or 
more than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. 

Another embodiment of the present invention is a method for inhibiting the expression of a gene in an operon 
required for proliferation comprising contacting a cell in a cell population with an antisense nucleic acid, said cell 
1 0 expressing a gene corresponding to one of SEQ ID NOs.: 82-88, 90-242, wherein said antisense nucleic acid comprises 

at least a proliferation-inhibiting portion of said operon in an antisense orientation that is effective in inhibiting 
expression of said gene. The antisense nucleic acid may be complementary to a sequence of a gene comprising one or 
more of SEQ ID NOs.: 82-88, 90-242. The antisense nucleic acid may be a sequence of one of SEQ ID NOs.: 1-81, 
405-485, or a portion thereof. The cell may be contacted with said antisense nucleic acid by introducing a plasmid 
15 which expresses said antisense nucleic acid into said cell population. The cell may be contacted with said antisense 

nucleic acid by introducing a phage which expresses said antisense nucleic acid into said cell population. The cell may 
be contacted with said antisense nucleic acid by introducing a sequence encoding said antisense nucleic acid into the 
chromosome of said cell into said cell population. The cell may be contacted with said antisense nucleic acid by 
introducing a retron which expresses said antisense nucleic acid into said cell population. The cell may be contacted 
20 with said antisense nucleic acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 

ribozyme is complementary to said antisense oligonucleotide. The cell may be contacted with said antisense nucleic 
acid by introducing a liposome comprising said antisense oligonucleotide into said cell. The cell may be contacted with 
said antisense nucleic acid by electroporation. The antisense nucleic acid may be a fragment comprising at least 10, 
at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90- 
25 242. The antisense nucleic acid may be an oligonucleotide. 

Another embodiment of the present invention is a method for identifying bacterial strains comprising the 

steps of: 

providing a sample containing a bacterial species; and 

identifying a bacterial species using a species specific probe having a sequence selected from the group 
30 consisting of SEQ ID NOs. 1-81, 405-485, 82-88, 90-242. 

Another embodiment of the present invention is a method for identifying a gene in a microorganism required 
for proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity of a gene or gene product required for 
proliferation in a first microorganism; 
35 (b) contacting a second microorganism with said inhibitory nucleic acid; 
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c) determining^rotl 
second microorganism; and 



(c) determining^flther said inhibitory nucleic acid from said first microorganism inhibits proliferation of said 



(d) identifying the gene in said second microorganism which is inhibited by said inhibitory nucleic acid. 
Another embodiment of the present invention is a method for assaying a compound for the ability to inhibit 

5 proliferation of a microorganism comprising: 

(a) identifying a gene or gene product required for proliferation in a first microorganism; 

(b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said homolog in said second 
microorgansim; 

10 (d) contacting said second microorganism with a proliferation-inhibiting amount of said inhibitory nucleic acid, 

thus sensitizing said second microorganism; 

(e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized microorganism to a greater 
extent than said compound inhibits proliferation of a nonsensitized microorganism. 

1 5 \ The step of identifying a gene involved in proliferation in a first microorganism may comprise: 

introducing a nucleic acid comprising a random genomic fragment from said first microorganism operably 
linked to a promoter wherein said random genomic fragment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a first level of said random genomic 
fragment to the proliferation of said first microorganism transcribing a lower level of said random genomic fragment, 
20 wherein a difference in proliferation indicates that said random genomic fragment comprises a gene involved in 

proliferation. 

The step of identifying a homolog of said gene in a second microorganism may comprise identifying a 
homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a database using an algorithm 
selected from the group consisting of BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 

25 algorithm with the default parameters. The step of identifying a homolog of said gene in a second microorganism may 

comprise identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying 
nucleic acids which hybridize to said first gene. The step of identifying a homolog of said gene in a second 
microorganism may comprise expressing a nucleic acid which inhibits the proliferation of said first microorganism in 
said second microorganism. The inhibitory nucleic acid may be an antisense nucleic acid. The inhibitory nucleic acid 

30 may comprise an antisense nucleic acid to a portion of said homolog. The inhibitory nucleic acid may comprise an 

antisense nucleic acid to a portion of the operon encoding said homolog. The step of contacting the second 
microorganism with a proliferation-inhibiting amount of said nucleic acid sequence may comprise directly contacting 
said second microorganism with said nucleic acid. The step of contacting the second microorganism with a 
proliferation-inhibiting amount of said nucleic acid sequence may comprise expressing an antisense nucleic acid to said 

35 homolog in said second microorganism. 
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Another eiMcliment of the present invention is a compound identified using the method above. 



Another embodiment of the present invention is a method of assaying a compound for the ability to inhibit 
proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the activity of a gene or gene product required 
5 for proliferation in a first microorgansim; 

(b) contacting a second microorganism with a proliferation-inhibiting amount of said inhibitory nucleic acid, thus 
sensitizing said second microorganism; 

(c) contacting the proliferation-inhibited microorganism of step (b) with a compound; and 

(d) determining .whether said compound inhibits proliferation of said sensitized second microorganism to a 
10 greater extent than said compound inhibits proliferation of a nonsensitized second microorganism. 

The inhibitory nucleic acid may be an amisense nucleic acid which inhibits the proliferation of said first 
microorganism. The inhibitory nucleic acid may comprise a portion of an amisense nucleic acid which inhibits the 
proliferation of said first microorganism. The inhibitory nucleic acid may comprise an amisense molecule against the 
entire coding region of the gene involved in proliferation of the first microorganism. The inhibitory nucleic acid may 
15 comprise an antisense nucleic acid to a portion of the operon encoding the gene involved in proliferation of the first 

microorganism. 

Another embodiment of the present invention is a compound identified using the method above. 
Another embodiment of the present invention is a method for assaying compounds for activity against a 
biological pathway required for proliferation comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid encoding a gene product 
required for proliferation in a cell to reduce the activity or amount of said gene produce- 
contacting the sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of an nonsensitized cell. 

25 The ce " mav be selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal 

cells. The cell may be an £ coli cell. The cell may be an organism selected from the group consisting of 
Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori. Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae. Salmonella typhimurium, Saccharomyces 
cerevisiae. Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella 

30 typhi. Salmonella paratyphi. Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 

Mycobacterium leprae, Treponema pallidum. Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter 
jejuni, and Chlamydia trachoma tus, Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. The antisense nucleic acid may be transcribed from an inducible promoter. The method may further comprise 
contacting the cell with an agent which induces expression of said antisense nucleic acid from said inducible promoter, • 

35 wherein said antisense nucleic acid is expressed at a sublethal leveL The sublethal level of said antisense nucleic acid 
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may inhibit proIiferatir^^8% or more. The agent may be isopropyM-thio-p-D-gSctoside (IPTG). The inhibition of 
proliferation may be measured by monitoring the optica! density of a liquid culture. The gene product may comprise a 
polypeptide having a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is a compound identified using the method above. 
. 5 Another embodiment of the present invention is a method for assaying a compound for the ability to inhibit 

cellular proliferation comprising: 

contacting a cell with an agent which reduces the activity or level of a gene product required for 
proliferation of said cell- 
contacting said cell with said compound; and 
10 determining whether said compound reduces proliferation to a greater extent than said compound reduces 

proliferation of cells which have not been contacted with said agent. 

The agent which reduces the activity or level of a gene product required for proliferation of said cell may 
comprise an antisense nucleic acid to a gene or operon required for proliferation. The agent which reduces the activity 
or level of a gene product required for proliferation of said cell may comprise an antibiotic. The cell may contain a 
15 temperature sensitive mutation which reduces the activity or level of said gene product required for proliferation of 

said cell. The antisense nucleic acid may be directed against the same functional domain of said gene product required 
for proliferation of said cell to which said antisense nucleic acid is directed. The antisense nucleic acid may be directed 
against a different functional domain of said gene product required for proliferation of said cell than the fucritional 
. domain to which said antisense nucleic acid is directed. 
20 Another embodiment of the present invention is a compound identified using the method above. 

Another embodiment of the present invention is a method for identifying the pathway in which a 
proliferation-required nucleic acid or its gene product lies comprising: 

expressing a sublethal level of an antisense nucleic acid directed against said proliferation-required nucleic 
acid in a cell; 

25 contacting said cell with an antibiotic, wherein the a biological pathway on which said antibiotic acts is 

known; and 

determining whether said cell has a substantially greater sensitivity to said antibiotic than a cell which does 
not express said sublethal level of said antisense nucleic acid. 

Another embodiment of the present invention is a method for determining the pathway on which a test 
30 compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a proliferation-required nucleic 
acid in a cell wherein the biological pathway in which said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said test compound than a cell 
35 which does not express said sublethal level of said antisense nucleic acid. 
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The method may further comprise: 

(d) expressing a sublethal level of a second antisense nucleic acid directed against a second proliferation-required 
nucleic acid in said cell, wherein said second proliferation-required nucleic acid is in a different biological pathway than 
said proliferation-required nucleic acid in step (a); and 

(e) determining whether said cell has a substantially greater sensitivity to said test compound than a cell which does 
not express said sublethal level of said second antisense nucleic acid. 

Another embodiment of the present invention is a purified or isolated nucleic acid consisting essentially of 
one of SEQ ID NOs: 358, 399-402. 

Another embodiment of the present invention is a purified or isolated nucleic acid comprising a sequence 
selected from the group consisting of 1-81, 405-485, 82-88, 90-242, 358, 399*402. 

Another embodiment of the present invention is a compound which interacts with the gene or gene product 
of a nucleic acid comprising a sequence of one of SEQ ID NOs: 82-88, 90-242 to inhibit proliferation. 

Another embodiment of the present invention compound which interacts with a polypeptide comprising one 
of SECT ID NOs. 243-357, 359-398 to inhibit proliferation. 

Another embodiment of the present invention is a compound which interacts with a nucleic acid comprising 
one of SEQ ID NOs: 358, 399-402 to inhibit proliferation. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is an IPTG dose response curve in £ coli transformed with an IPTG-inducible plasmid 
containing either an antisense clone to the £ coli ribosomal protein rplW (AS-rp!W) which is required for protein 
synthesis and essential cell proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be 
involved in protein synthesis and which is also essential for proliferation. 

Figure 2A is a tetracycline dose response curve in £ coli transformed with an IPTG-inducible 
plasmid containing antisense to rplW(AS-rplW) in the presence of 0, 20 or 50 jiM IPTG. 

Figure 2B is a tetracycline dose response curve in £. coli transformed with an IPTG-inducible 
plasmid containing antisense to elaD (AS-elaD) in the presence of 0, 20 or 50 yM IPTG. 

Figure 3 is a graph showing the fold increase in tetracycline sensitivity of £ coli transfected with 
antisense clones to essential ribosomal proteins L23 (AS-rplW) and L7/L12 and L10 (AS-rplLrplJ). Antisense clones to 
genes known not to be involved in protein synthesis (atpB/E(AS-atpB/E ), visC (AS-visC, elaD (AS-elaD), yohH (AS- 
yohH) are much less sensitive to tetracycline. 

Definitions 

By "biological pathway" is meant any discrete cell function or process that is carried out by a gene product 
or a subset of gene products. Biological pathways include enzymatic, biochemical and metabolic pathways as well as 
pathways involved in the production of cellular structures such cell walls. Biological pathways that are usually 
required for proliferation of microorganisms include, but are not limited to, cell division, DNA synthesis & replication, 
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RNA symhesis (Inscription), protein synthesis (translation), protein processing, protein transport, fatty acid 
biosynthesis, cell wall synthesis, cell membrane synthesis & maintenance, etc. 

By "inhibit activity against a gene or gene product" is meant having the ability to interfere with the function 
of a gene or gene product in such a way as to decrease expression of the gene or to reduce the level or activity of a 
product of the gene. Agents which have activity against a gene include agents that inhibit transcription of the gene 
and agents that inhibit translation of the mRNA transcribed from the gene. In microorganisms, agents which have 
activity against a gene can act to decrease expression of the operon in which the gene resides or alter the processing 
of operon RNA such as to reduce the level or activity of the gene product. The gene product can be a non-translated 
RNA such as ribosomal RNA, a translated RNA (mRNA) or the protein product resulting from translation of the gene 
mRNA. Of particular utility to the present invention are anti-sense RNAs that have activities against the operons or 
genes to which they specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the function or to reduce the level 
or activity of the gene product in a cell. 

By "activity against a protein" is meant having the ability to inhibit the function or to reduce the level or 
activity of the protein in a cell. 

By "activity against nucleic acid" is meant having the ability to inhibit the function or to reduce the level or 
activity of the nucleic acid in a cell. 

As used herein, "sublethal" means a concentration of an agent below the concentration required to inhibit all 
cell growth. 

DETAILED DESCRIPTION DF THE INVENTION 

The present invention describes a group of £ coli genes and gene families required for growth and/or 
proliferation. A proliferation-required gene or gene family is one where, in the absence of a gene transcript and/or gene 
product, growth or viability of the microorganism is reduced or eliminated. Thus, as used herein the terminology 
* "proliferation-required" or "required for proliferation" encompasses sequences where the absence of a gene transcript 
and/or gene product completely eliminates cell growth as well as sequences where the absence of a gene transcript and/or 
gene product merely reduces cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also encompasses novel assays for 
analyzing proliferation-required genes and for identifying compounds which interact with the gene products of the 
proliferation-required genes. In addition, the present invention contemplates the expression of genes and the purification of 
the proteins encoded by the nucleic acid sequences identified as required proliferation genes and reported herein. The 
purified proteins can be used to generate reagents and screen small molecule libraries or other candidate compound libraries 
for compounds that can be further developed to yield novel antimicrobial compounds. The present invention also describes 
methods for identification of homologous genes in organisms other than £ coli. 

The present invention utilizes a novel method to identify proliferation-required £ coli sequences. Generally, a 
library of nucleic acid sequences from a given source are subcioned or otherwise inserted into an inducible expression 
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vector, thus forminffn expression library. Although the insert nucleic acids may be derived from the chromosome of the 
organism into which the expression vector is to be introduced, because the insert is not in its natural chromosomal location, 
the insert nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein. The term expression is 
defined as the production of an RNA molecule from a gene, gene fragment, genomic fragment, or operon. Expression can 
5 also be used to refer to the process of peptide or polypeptide synthesis. An expression vector is defined as a vehicle by 

which a ribonucleic acid (RNA) sequence is transcribed from a nucleic acid sequence carried within the expression vehicle. 
The expression vector can also contain features that permit translation of a protein product from the transcribed RNA 
message expressed from the exogenous nucleic acid sequence carried by the expression vector. Accordingly, an expression 
vector can produce an RNA molecule as its sole product or the expression, vector can produce a RNA molecule that is 
10 ultimately translated into a protein product. 

Once generated, the expression library containing the exogenous nucleic acid sequences is introduced into an £ 
coli population to search for genes that are required for bacterial proliferation. Because the library molecules are foreign to 
the population of £ coli. the expression vectors and the nucleic acid segments contained therein are considered exogenous 
nucleic acid. 

Expression of the exogenous nucleic acid fragments in the test population of £ coli containing the expression 
vector library is then activated. Activation of the expression vectors consists of subjecting the cells containing the vectors 
to conditions that result in the expression of the exogenous nucleic acid sequences carried by the expression vector library. 
The test population of £ coli ceils is then assayed to determine the effect of expressing the exogenous nucleic acid 
fragments on the test population of cells. Those expression vectors that, upon activation and expression, negatively 
20 impact the growth of the £ coli screen population were identified, isolated, and purified for further study. 

A variety of assays are contemplated to identify nucleic acid sequences that negatively impact growth upon 
expression. In one embodiment, growth in £ coli cultures expressing exogenous nucleic acid sequences and growth in 
cultures not expressing these sequences is compared. Growth measurements are assayed by examining the extent of 
growth by measuring optical densities. Alternatively, enzymatic assays can be used to measure bacterial growth rates 
to identify exogenous nucleic acid sequences of interest. Colony size, colony morphology, and cell morphology are 
additional factors used to evaluate growth of the host cells. Those cultures that failed to grow or grow with reduced 
efficiency under expression conditions are identified as containing an expression vector encoding a nucleic acid fragment 
that negatively affects a proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are analyzed. The first step of the 
30 analysis is to acquire the nucleic acid sequence of the nucleic acid fragment of interest. To achieve this end, the insert in 

those expression vectors identified as containing a sequence of interest is sequenced, using standard techniques well 
known in the art. The next step of the process is to determine the source of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence data with known sequences 
in various genetic databases. The sequences identified are used to probe these gene databases. The result of this 
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procedure is a list ofWenous nucleic acid sequences corresponding to a list thaSudeds novel bacterial genes required 
for proliferation. as well as genes previously identified as required for proliferation. 

The number of DNA and protein sequences available in database systems has been growing exponentially for 
years. For example, at the end of 1 998, the complete sequences of Caenorhabditis elegans, Saccharomyces cerevisiae and 
nineteen bacterial genomes, including £ coli were available. This sequence information is stored in a number of databanks, 
such as GenBank (the National Center for Biotechnology Information (NCBI), and is publicly available for searching. 

A variety of computer programs are available to assist in the analysis of the sequences stored within these 
databases. FastA, (W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP and FASTA H 
Methods in Enzymology 183:63- 98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing and 
retrieval tool for flat file data libraries. Comput. AppL Biosci. 9:49-57, 1993) are two examples of computer programs 
that can be used to analyze sequences of interest. In one embodiment of the present invention, the BLAST family of 
computer programs, which includes BLASTW version 2.0 with the default parameters, or BLASTX version 2.0 with the 
default parameters, is used to analyze nucleic acid sequences. 

BLAST, an acronym for "Basic Local Alignment Search Tool," is a family of programs for database similarity 
searching. The BLAST family of programs includes: BLASTN, a nucleotide sequence database searching program, 
BLASTX, a protein database searching program where the input is a nucleic acid sequence;, and BLASTP, a protein 
database searching program. BLAST programs embody a fast algorithm for sequence matching, rigorous statistical 
methods for judging the significance of matches, and various options for tailoring the program for special situations. 
Assistance in using the program can be obtained by e-mail at blastfancbi.nlm.nih.Qov. 

Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, which are a 
collection of genes and intergentc sequences. The genes of an operon are co-transcribed and are often related functionally. 
Given the nature of the screening protocol, it is possible that the identified exogenous nucleic acid sequence corresponds to 
a gene or portion thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a 
gene), an intergenic sequence (i.e. a sequence between genes), a sequence spanning at least a portion of two or more 
genes, a 5' noncoding region or a 3' noncoding region located upstream or downstream from the actual sequence that is 
required for bacterial proliferation. Accordingly/ determining which of the genes that are encoded within the operons are 
individually required for proliferation is often desirable. 

In one embodiment of the present invention, an operon is dissected to determine which gene or genes are required 
for proliferation. For example, the RegulonDB DataBase described by Huerta et al. \NucL Acids Res. 26:55-59, 1998), 
which may also be found on the website http://www.cifn.unam.mx/Computational_Biology/regulondb/, may be used, to 
identify the boundaries of operons encoded within microbial genomes. A number of techniques that are well known in the 
art can be used to dissect the operon. In one aspect of this embodiment, gene disruption by homologous recombination is 
used to individually inactivate the genes of an operon that is thought to contain a gene required for proliferation. 

Several gene disruption techniques have been described for the replacement of a functional gene with a 
mutated, non-functional (null) allele. These techniques generally involve the use of homologous recombination. The 
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method described bW et al. (J. Bacteriol 1997 179:6228; incorporated herein by reference in it's entirety) serves 
as an excellent example of these methods as applicable to disruption of genes in £. coli. This technique uses crossover 
PCR to create a null allele with an in-frame deletion of the coding region of a target gene. The null allele is 
constructed in such a way that sequences adjacent to the wild type gene (ca. 500 bp) are retained. These 
homologous sequences surrounding the deletion null allele provide targets for homologous recombination so that the 
wild type gene on the £ caff chromosome can be replaced by the constructed null allele. 

The crossover PCR amplification product is subcloned into the vector pK03, the features of which include a 
chloramphenicol resistance gene, the counter-selectable marker sacB, and a temperature sensitive autonomous 
replication function. Following transformation of an £ a?// cell population with such a vector, selection for cells that 
have undergone homologous recombination of the vector into the chromosome is achieved by growth on 
chloramphenicol at the non-permissive temperature of 43°C. Under these conditions, autonomous replication of the 
plasmid cannot occur and cell are resistant to chloramphinicol only if the chloramphenicol resistance gene has been 
integrated into the chromosome. Usually a single crossover event is responsible for this integration event such that 
the £ coli chromosome now contains a tandem duplication of the target gene consisting of one wild type allele and 
15 one deletion null allele separated by vector sequence. 

This new £ coli strain containing the tandem duplication can be maintained at permissive temperatures in 
the presence of drug selection (chloramphenicol). Subsequently, cells of this new strain are cultured at the permissive 
temperature 30°C without drug selection. Under these conditions, the chromosome of some of the cells within the 
population will have undergone an internal homologous recombination event resulting in removal of the plasmid 
sequences. Subsequent culturing of the strain in growth medium lacking chloramphenicol but containing sucrose is 
used to select for such recombinative resolutions. In the presence of the counter-selectable marker sacB, sucrose is 
rendered into a toxic metabolite. Thus, cells that survive this counter-selection have lost both the plasmid sequences 
from the chromosome and the autonomously replicating plasmid that results as a byproduct of recombinative 
resolution. 

25 There are two P° ssible outcomes of the above recombinative resolution via homologous recombination. 

Either the wild type copy of the targeted gene is retained on the chromosome or the mutated null allele is retained on 
the chromosome. In the case of an essential gene, a single copy of the null allele would be lethal and such cells should 
not be obtained by the above procedure when applied to essential genes. In the case of a non essential gene, roughly 
equal numbers of cells containing null alleles and cells containing wild type alleles should be obtained. Thus, the 

30 method serves as a test for essentiality of the targeted gene: when applied to essential genes, only cells with a wild 

type allele on the chromosome will be obtained. 

Other techniques have also been described for the creation of disruption mutations in £ coli. For example, 
Link et al. also describe inserting an in-frame sequence tag concommitantly with an in-frame deletion in order to 
simplify analysis of recombinants obtained. Further, Link et al. describe disruption of genes with a drug resistance 

35 marker such as a kanamycin resistance gene. Arigoni et al., (Arigoni, F. et al. A Genome-based Approach for the 
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Identification of EssW Bacterial Genes. Nature Biotechnology 16: 851-856, disclosure of which is incorporated 
herein by reference in its entirety) describe the use of gene disruption combined with engineering a second copy of a 
test gene such that the expression of the gene is regulated by and inducible promoter such as the arabinose promoter 
to test the essentiality of the gene. Many of these techniques result in the insertion of large fragments of ONA into 
the gene of interest, such as a drug selection marker. An advantage of the technique described by Link et al. is that it 
does not rely on an insertion into the gene to cause a functional defect, but rather results in the precise removal of the 
coding region. This insures the lack of polar effects on the expression of genes downstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences of the gene identified as 
required for proliferation, or portions thereof. The over-expressed proteins can be used as reagents for further study. The 
identified exogenous sequences are isolated, purified, and cloned into a suitable expression vector using methods well 
known in the art. If desired, the nucleic acids can contain the sequences encoding a signal peptide to facilitate secretion of 
the expressed protein. 

Expression of fragments of the bacterial genes identified as required for proliferation is also contemplated by the 
present invention: The fragments of the identified genes can encode a polypeptide comprising at least 5, at least 10, at 
least 1 5, at least 20, at least 25, at least 30, at least 35. at least 40, at least 45, at least 50, at least 55, at least 60, at 
least 65, at least 75, or more than 75 consecutive amino acids of a gene complementary to one of the identified sequences 
of the present invention. The nucleic acids inserted into the expression vectors can also contain sequences upstream and 
downstream of the coding sequence. 

When expressing the coding sequence of an entire gene identified as required for bacterial proliferation or a 
fragment thereof, the nucleic acid sequence to be expressed is operably linked to a promoter in an expression vector using 
conventional cloning technology. The expression vector can be any of the bacterial, insect, yeast, or mammalian expression 
systems known in the art. Commercially available vectors and expression systems are available from a variety of suppliers 
including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 
Invitrogen (San Diego, California). If desired, to enhance expression and facilitate proper protein folding, the codon usage 
and codon bias of the sequence. can be optimized for the particular expression organism in which the expression vector is 
introduced, as explained by Hatfield, et al., U.S: Patent No. 5,082,767, incorporated herein by this reference. Fusion 
protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic acid sequence, the protein is 
purified. Protein purification techniques are well known in the art. Proteins encoded and expressed from identified 
exogenous nucleic acid sequences can be partially purified using precipitation techniques, such as precipitation with 
polyethylene glycol. Chromatographic methods usable with the present invention can include ion-exchange 
chromatography, gel filtration, use of hydroxyapaptite columns, immobilized reactive dyes, chromatofocusing, and use of 
high-performance liquid chromatography. Electrophoretic methods such one-dimensional gel electrophoresis, high-resolution 
two-dimensional polyacrylamide electrophoresis, isoelectric focusing, and others are contemplated as purification methods. 
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Also, affinity chronWgraphic methods, comprising antibody columns, ligand presenting columns and other affinity 
chromatographic matrices are contemplated as purification methods in the present invention. 

The purified proteins produced from the gene coding sequences identified as required for proliferation can be used 
in a variety of protocols to generate useful antimicrobial reagents. In one embodiment of the present invention, antibodies 
are generated against the proteins expressed from the identified exogenous nucleic acid sequences. Both monoclonal and 
polyclonal antibodies can be generated against the expressed proteins. Methods for generating monoclonal and polyclonal 
antibodies are well known in the art. Also, antibody fragment preparations prepared from the produced antibodies 
discussed above are contemplated. 

Another application for the purified proteins of the present invention is to screen small molecule libraries for 
candidate compounds active against the various target proteins of the present invention. Advances in the field of 
combinatorial chemistry provide methods, well known in the art, to produce large numbers of candidate compounds that 
can have a binding, or otherwise inhibitory effect on a target protein. Accordingly, the screening of small molecule libraries 
for compounds with binding affinity or inhibitory activity for a target protein produced from an identified gene sequence is 
contemplated by the present invention. 

The present invention further contemplates utility against a variety of other pathogenic organisms in addition to 
£ coli. For example, the invention has utility in identifying genes required for proliferation in prokaryotes and eukaryotes. 
For example, the invention has utility with protists, such as Plasmodium spp.; plants; animals, such as Entamoeba spp. 
and Contracaecum spp; and fungi including Candida spp., (e.g., Candida albicans), Saccharomyces cerevisiae, 
Cryptococcus neoformans, and Aspergillus fumigatus. In one embodiment of the present invention, monera, specifically 
bacteria are probed in search of novel gene sequences required for proliferation. This embodiment is particularly important 
given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics are growing. A partial list of 
these organisms includes: Staphylococcus spp., such as S. aureus; Enterococcus spp., such as £ faecalis; 
Pseudomonas spp., such as P. aeruginosa, Clostridium spp., such as C. botulinum, Haemophilus spp., such as H. 
influenzae, Enterobacter spp., such as £ cloacae, Vibrio spp., such as V. cholera; Moraxala spp., such as M. 
catarrhal; Streptococcus spp., such as S. pneumoniae, Neisseria spp., such as N. gonorrhoeae; Mycoplasma spp., 
such as Mycoplasma pneumoniae; Salmonella typhimurium; Helicobacter pylori; Escherichia coli; and Mycobacterium 
tuberculosis. The sequences identified as required for proliferation in the present invention can be used to probe these 
and other organisms to identify homologous required proliferation genes contained therein. 

In one embodiment of the present invention, the nucleic acid sequences disclosed herein are used to screen 
genomic libraries generated from bacterial species of interest other than £ coli. For example, the genomic library may be 
from Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori. Neisseria 
gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae. Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. 
Salmonella typhi. Salmonella paratyphi. Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium 
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tuberculosis, Myco^fium leprae, Treponema pallidum. Bacillus anthracis, ^Knia pestis, Clostridium botulinum, 
Campylobacter jejuni, Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any 
of the above species. Standard molecular biology techniques are used to generate genomic libraries from various 
microorganisms. In one aspect, the libraries are generated and bound to nitrocellulose paper. The identified exogenous 
5 nucleic acid sequences of the present invention can then be used as probes to screen the libraries for homologous 

sequences. The homologous sequences identified can then be used as targets for the identification of new, antimicrobial 
compounds with activity against more than one organism. 

For example, the preceding methods may be used to isolate nucleic acids having a sequence with at least 
97%,-at least 95%, at least 90%, at least 85%, at least 80%, or at least 70% identity to a nucleic acid sequence 
10 selected from the group consisting of one of the sequences of SEQ ID NOS. 1-81, 405-485, 82-88, 90-242, fragments 

comprising at least 10, 15, 20, 25, 30. 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive bases thereof, and- 
the sequences complementary thereto. Identity may be measured using BLASTN version 2.0 with the default 
parameters. (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search 
Programs, Nucleic Acid Res. 25: 3389-3402 (1997), the disclosure of which is incorporated herein by reference in its 
15 entirety). For example, the homologous polynucleotides may have a coding sequence which is a naturally occurring 

allelic variant of one of the coding sequences described herein. Such allelic variants may have a substitution, deletion 
or addition of one or more nucleotides when compared to the nucleic acids of SEQ ID NOs: 1-81, 405-485, 82-88, 90- 
242 or the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which encode polypeptides having at 
20 least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at least 50%, or at least 40% 

identity or similarity to a polypeptide having the sequence of one of SEQ ID NOs: 243-357, 359-398or fragments 
comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids thereof as determined 
using the FASTA version 3.0t78 algorithm with the default parameters. Alternatively, protein identity or similarity 
may be identified using BLASTP with the default parameters, BLASTX with the default parameters, or TBLASTN with 
25 the default parameters. (Alschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein Database 

Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997), the disclosure of which is incorporated herein by 
reference in its entirety). 

Alternatively, homologous nucleic acids or polypeptides may be identified by searching a database to identify 
sequences having a desired level of homology to a nucleic acid or polypeptide involved in proliferation or an antisense 
30 nucleic acid to a nucleic acid involved in microbial proliferation. A variety of such databases are available to those 

skilled in the art, including GenBank and GenSeq. In some embodiments, the databases are screened to identify 
nucleic acids or polypeptides having at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, at least 
70%, at least 60%, or at least 50%, at least 40% identity or similarity to a nucleic acid or polypeptide involved in 
proliferation or an antisense nucleic acid involved in proliferation. For example, the database may be screened to 
identify nucleic acids homologous to one of SEQ ID Nos. 1-81, 405-485, 82-88, 90-242 or polypeptides homologous 
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to SEQ ID NOs. 243W. 359-398. In some embodiments, the database may be screened to identify homologous 
nucleic acids or polypeptides from organisms other than £ coli, including organisms such as Staphylococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis. 
Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida 
5 albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi. Salmonella 

paratyphi, Salmonella cholerasuis. Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, 
Treponema pallidum. Bacillus anthracis. Yersinia pestis, Clostridium botulinum, Campylobacter jejuni,Chlamydia 
trachomatus. Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

In another embodiment, gene expression arrays and microarrays can be employed. Gene expression arrays 
10 are high density arrays of DMA samples deposited at specific locations on a glass chip, nylon membrane, or the like. 

Such arrays can be used by researchers to quantify relative gene expression under different conditions. Gene 
expression arrays are used by researchers to help identify optimal drug targets, profile new compounds, and determine 
disease pathways. An example of this technology is found in U.S. Patent No. 5807522, which is hereby incorporated 
by reference. 

15 11 is P° ssible t0 stud V tne expression of ail genes in the genome of a particular microbial organism using a 

single array. For example, the arrays from Genosys consist of 12 x 24 cm nylon filters containing PCR products 
corresponding to 4290 ORFs from £ coli. 10 ngs of each are spotted every 1.5 mm on the filter. Single stranded 
labeled cDNAs are prepared for hybridization to the array (no second strand synthesis or amplification step is done) 
and placed in contact with the filter. Thus the labeled cDNAs are of "antisense" orientation. Quantitative analysis is 

20 done by phosphorimager. 

Hybridization of cDNA made from a sample of total cell mRNA to such an array followed by detection of 
binding by one or more of various techniques known to those in the art results in a signal at each location on the array 
to which cDNA hybridized. The intensity of the hybridization signal obtained at each location in the array thus reflects 
the amount of mRNA for that specific gene that was present in the sample. Comparing the results obtained for mRNA 

25 isolated from cells grown under different conditions thus allows for a comparison of the relative amount of expression 

of each individual gene during growth under the different conditions. 

Gene expression arrays may be used to analyze the total mRNA expression pattern at various time points 
after induction of an antisense nucleic acid against a proliferation-required gene. Analysis of the expression pattern 
indicated by hybridization to the array provides information on whether or not the target gene of the antisense nucleic 

30 acid is being affected by antisense induction, how quickly the antisense is affecting the target gene, and for later 

timepoints, what other genes are affected by antisense expression. For example, if the antisense is directed against a 
gene for ribosomal protein L7/L12 in the 50S subunit. its targeted mRNA may disappear first and then other mRNAs 
may be observed to increase, decrease or stay the same. Similarly, if the antisense is directed against a different 50S 
subunit ribosomal protein mRNA (e.g. L25), that mRNA may disappear first followed by changes in mRNA expression 

35 that are similar to those seen with the L7/L12 antisense expression. Thus, the mRNA expression pattern observed 
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with an antinsense^Rc acid against a proliferation required gene may iden^ryother proliferation-required nucleic 
acids in the same pathway as the target of the antisense nucleic acid. In addition, the mRNA expression patterns 
observed with candidate drug compounds may be compared to those observed with antisense nucleic acids against a 
proliferation-required nucleic acid. If the mRNA expression pattern observed with the candidate drug compound is 
similar to that observed with the antisense nucleic acid, the drug compound may be a promising therapeutic candidate. 
Thus, the assay would be useful in assisting in the selection of candidate drug compounds for use in screening 
methods such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source of the nucleic acid being 
hybridized to the array are from two different organisms, gene expression arrays can identify homologous genes in the 
two organisms. 

The present invention also contemplates additional methods for screening other microorganisms for proliferation- 
required genes. In this embodiment, the conserved portions of sequences identified as proliferation-required can be used to 
generate degenerate primers for use in the polymerase chain reaction (PCR). The PGR technique is well known in the art. 
The successful production of a PCR product using degenerate probes generated from the sequences identified herein would 
indicate the presence of a homologous gene sequence in the species being screened. This homologous gene is then 
isolated, expressed, and used as a target for candidate antibiotic compounds. In another aspect of this embodiment, the 
homologous gene is expressed in an autologous organism or in a heterologous organism in such a way as to alter the level 
or activity of a homologous gene required for proliferation in the autologous or heterologus organism. In still another aspect 
of this embodiment, the homologous gene or portion is expressed in an antisense orientation in such a way as to alter the 
level or activity of a nucleic acid required for proliferation of an autologous or heterologous organism. 

The homologous sequences to proliferation-required genes identified using the techniques described herein 
may be used to identify proliferation-required genes of organisms other than £ coff; to inhibit the proliferation of 
organisms other than £ coff by inhibiting the activity or reducing the amount of the identified homologous nucleic acid 
or polypeptide in the organism other than £ coli, or to identify compounds which inhibit the growth of organisms other 
than E. coli as described below. 

In another embodiment of the present invention, £ a?// sequences identified as required for proliferation are 
transferred to expression vectors capable of function within non-£ coli species. As would be appreciated by one of 
ordinary skill in the art, expression vectors must contain certain elements that are species specific. These elements 
can include promoter sequences, operator sequences, repressor genes, origins of replication, ribosomal binding 
sequences, termination sequences, and others. To use the identified exogenous sequences of the present invention, 
one of ordinary skill in the art would know to use standard molecular biology techniques to isolate vectors containing 
the sequences of interest from cultured bacterial cells, isolate and purify those sequences, and subclone those 
sequences into an expression vector adapted for use in the species of bacteria to be screened. 

Expression vectors for a variety of other species are known in the art. For example, Cao et al. report the 
expression of steroid receptor fragments jn Staphylococcus aureus. J. Steroid Biochem Mol Biol. 44(l) : Mi 
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(1993). Also. Pla eflThave reported an expression vector that is functional in a number of relevant hosts including: 
Salmonella typhimurium, Pseudomonas putida, and Pseudomonas aeruginosa. J. Bacteriol. 172(8):4448-55 (1990). 
These examples demonstrate the existence of molecular biology techniques capable of constructing expression vectors 
for the species of bacteria of interest to the present invention. 

Following the subcioning of the identified nucleic acid sequences into an expression vector functional in the 
microorganism of interest, the identified nucleic acid sequences are conditionally transcribed to assay for bacterial 
growth inhibition. Those expression vectors found to contain sequences that, when transcribed, inhibit bacterial 
growth are compared to the known genomic sequence of the pathogenic microorganism being screened or, if the 
homologous sequence from the organism being screened is not known, it may be identified and isolated by 
hybridization to the proliferation-required £ coli sequence interest or by amplification using primers based on the 
proliferation-required £ coli sequence of interest as described above. 

The antisense sequences from the second organism which are identified as described above may then be 
operably linked to a promoter, such as an inducible promoter, and introduced into the second organism. The 
techniques described herein for identifying £. coli genes required for proliferation may thus be employed to determine 
whether the identified sequences from a second organism inhibit the proliferation of the second organism. 

Antisense nucleic acids required for the proliferation of organisms other than £ coli or the genes 
corresponding thereto, may also be hybridized to a microarray containing the £ coli ORFs to gauge the homology 
between the £ coli sequences and the proliferation-required nucleic acids from other organisms. For example, the 
proliferation-required nucleic acid may be from Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter 
20 cloacae, Helicobacter pylori, Neisseria gonorrhoeae. Enterococcus faecalis. Streptococcus pneumoniae, Haemophilus 

influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans. 
Aspergillus fumigatus. Klebsiella pneumoniae, Salmonella typhi. Salmonella paratyphi, Salmonella cholerasuis. 
Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, bacillus 
antnracis. Yersinia pestis, Clostridium botulinum, Campylobacter jejuni or Chlamydia trachomatus. Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. The proliferation-required nucleic 
acids from an organism other than f. coli may be hybridized to the array under a variety of conditions which permit 
hybridization to occur when the probe has different levels of homoloay to the sequence on the microarray. This would 
provide an indication of homology across the organisms as well as clues to other possible essential genes in these 
organisms. 

In still another embodiment, the exogenous nucleic acid sequences of the present invention that are identified as 
required for bacterial growth or proliferation can be used as antisense therapeutics for killing bacteria. The antisense 
sequences can be directed against the proliferation-required genes whose sequence corresponds to the exogenous nucleic 
acid probes identified here (i.e. the antisense nucleic acid may hybridize to the gene or a portion thereof). Alternatively, 
antisense therapeutics can be directed against operons in which proliferation-required genes reside (i.e. the antisense 
nucleic acid may hybridize to any gene in the operon in which the proliferation-required genes reside). Further, antisense 
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therapeutics can beWted against a proliferation-required gene or portion thereof with or without adjacent noncoding 
sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic sequence (i.e. a sequence between genes), 
a sequence spanning at least a portion of two or more genes, a 5' noncoding region or a 3' noncoding region located 
upstream or downstream from the actual sequence that is required for bacterial proliferation or an operon containing a 
5 proliferation-required gene. 

In addition to therapeutic applications, the present invention encompasses the use of nucleic acid sequences 
complementary to sequences required for proliferation as diagnostic tools. For example, nucleic acid probes complementary 
to proliferation-required sequences that are specific for particular species of microorganisms can be used as probes to 
identify particular microorganism species in clinical specimens. This utility provides a rapid and dependable method by 

10 which to identify the causative agent or agents of a bacterial infection. This utility would provide clinicians the ability to 

prescribe species specific antimicrobial compounds to treat such infections. In an extension of this utility, antibodies 
generated against proteins translated from mRNA transcribed from proliferation-required sequences can also be used to 
screen for specific microorganisms that produce such proteins in a species-specific manner. 

The following examples teach the genes of the present invention and a subset of uses for the £ coli genes 

15 identified as required for proliferation. These examples are illustrative only and are not intended to limit the scope of the 

present invention. 

EXAMPLES . . 

The following examples are directed to the identification and exploitation of £ coli genes required for 
proliferation. Methods of gene identification are discussed as well as a variety of methods to utilize the identified 
20 sequences. 

Genes Identified as Required for Proliferation of £ coli 

Exogenous nucleic acid sequences were cloned into an inducible expression vector and assayed for growth 
inhibition activity. Example 1 describes the examination of a library of exogenous nucleic acid sequences cloned into IPTG- 
25 inducible expression vectors. Upon activation or induction, the expression vectors produced an RNA molecule 

corresponding to the subcloned exogenous nucleic acid sequences. The RNA product was in an antisense orientation with 
respect to the £ coli genes from which it was originally derived. This antisense RNA then interacted with sense mRNA 
produced from various £ coli genes and interfered with or inhibited the translation of the sense messenger RNA (mRNA) 
thus preventing protein production from these sense mRNA molecules. In cases where the sense mRNA encoded a protein 
required for the proliferation, bacterial cells containing an activated expression vector failed to grow or grew at a 
substantially reduced rate. 

EXAMPLE 1 

Inhibition of Bacterial Proliferation after IPTG induction 
To study the effects of transcriptional induction in liquid medium, growth curves were carried out by back 
35 diluting cultures 1:200 into fresh media with or without 1 mM IPTG and measuring the OD 450 every 30 minutes (min). To 
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study the effects of transcriptional induction on solid medium, 10 3 , 10 3 . 10\ 10 5 , 10 6 , 10' and 10 8 fold dilutions of 
overnight cultures were prepared. Aliquots of from 0.5 to 3 pd of these dilutions were spotted on selective agar plates 
with or without 1 mM IPTG. After overnight incubation, the plates were compared to assess the sensitivity of the clones 
to IPTG. 

Of the numerous clones tested, some clones were identified as a containing sequence that inhibited £ coli 
growth after IPTG induction. Accordingly, the gene to which the inserted nucleic acid sequence corresponds, or a gene 
within the operon containing the inserted nucleic acid, may be required for proliferation in £ coli. 

Characterization of Isolated Clones Negatively Affecting £ coli Proliferation 

Following the identification of those expression vectors that, upon expression, negatively impacted £ ^//growth 
or proliferation, the inserts or nucleic acid fragments contained in those expression vectors were isolated for subsequent 
characterization. Expression vectors of interest were subjected to nucleic acid sequence determination. 

EXAMPLE 2 

Nucleic Acid Sequence Determination of Identified Clones FrnrP^ n o Nucleic Arid Fr^ments with flPtrimgntal Effects nf F 
15 coli Proliferation 

The nucleotide sequences for the exogenous identified sequences were determined using plasmid DNA isolated 

using QIAPREP (Qiagen, Valencia. CA) and methods supplied by the manufacturer. The primers used for sequencing the 

inserts were 5' • TGTTTATCAGACCGCTT - 3' (SEQ ID NO: 403) and 5' - ACAATTTCACACAGCCTC • 3' (SEQ ID NO: 404). 

These sequences flank the polylinker in pLEX5BA. Sequence identification numbers (SEQ ID NOs) for the identified inserts 
20 are listed in Table I and discussed below. 

EXAMPLE 3 

Comparison Of Isolated Sequences tn Known Sequences 
The nucleic acid sequences of the subcloned fragments obtained from the expression vectors discussed above 
were compared to known £ coli sequences in GenBank using BLAST version 1.4 or version 2.0.6 using the following 
default parameters: Filtering off, cost to open a gap -5, cost to extend a gap -2, penalty for a mismatch in the blast 
portion of run-3, reward for a match in the blast portion of run- 1, expectation value (e)- 10.0, word size- 11, number of 
one-line descriptions - 100, number of alignments to show |B)-100. BLAST is described in Altschul, J Mol Biol. 215:403- 
10 (1990), the disclosure of which is incorporated herein by reference in its entirety. Expression vectors were found to 
contain nucleic acid sequences in both the sense and antisense orientations. The presence of known genes, open reading 
frames, and ribosome binding sites was determined by comparison to public databases holding genetic information and 
various computer programs such as the Genetics Computer Group programs FRAMES and CODONPREFERENCE. Clones 
were designated as "antisense" if the cloned fragment was oriented to the promoter such that the RNA transcript 
produced was complementary to the expressed mRNA from a chromosomal locus. Clones were designated as "sense" if 
they coded for an RNA fragment that was identical to a portion of a wild type mRNA from a chromosomal locus. 
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The sequerl^Rscribed in Examples 1-2 that inhibited bacterial profit e^^^n and contained gene fragments in 
an antisense orientation are listed in Table I. This tabie lists each identified sequence by: a sequence identification number; 
a Molecule Number; a gene to which the identified sequence corresponds, listed according to the National Center for 
Biotechnology Information (NCBI), Blattner (Science 277:1453-1474(1997); also contains the £ coil K-12 genome 
5 sequence), or Rudd (Micro, and Mol. Rev. 62:985-1019 (1998)), (both papers are hereby incorporated by reference) 

nomenclatures. The CONTIG numbers for each identified sequence is shown, as well as the location of the first and last 
base pairs located on the E coil chromosome. A Molecule Number with a "**" indicates a clone corresponding to an 
intergenic sequence. 

The sequences of the nucleic acid inserts of SEQ ID NOs: 1-81 from U.S. Provisional Patent Application No. 
10 60/1 1 7,405 which inhibited proliferation were further analyzed. The reanalyzed sequences corresponding to SEQ ID NOs. 

1-81 of U.S. Provisional Patent Application No. 60/1 17,405 have SEQ ID NOs. 405-485 in the present application. 

SEQ ID NOs: 82-242 in U.S. Provisional Patent Application No. 60/117,405 are identical to SEQ ID NOs: 82- 
242 of the present application with the following exceptions. SEQ ID NO: 148 in the present application is the 
complementary strand of SEQ ID NO: 148 in U.S. Provisional Patent Application No. 60/117,405. Accordingly, the 
15 protein of SEQ ID NO: 308 which is encoded by SEQ ID NO: 148 has also been revised. SEQ ID NO: 163 in the present 

application is the complementary strand of SEQ ID NO: 163 in U.S. Provisional Patent Application No. 60/117,405. 
Accordingly, the protein of SEQ ID NO: 323 which is encoded by SEQ ID NO: 1 63 has also been revised. 

The target gene of SEQ ID NQs. 18 and 19 of U.S. Provisional Patent Application No. 60/117,405 (SEQ ID 
NOs. 18, 19, 422, 423 of the present application) has been revised from dicF to ftsZ to reflect the fact that these 
20 SEQ ID NOs. include natural antisense molecules which inhibit ftsZ expression. 

The gene products of the nucleic acids of SEQ ID NOs. 198 and 239-242 in U.S. Provisional Patent 
Application No. 60/1 17,405 and in the present application (SEQ ID NOs. 358 and 399-402 of the present application) 
have been revised to reflect the fact that these nucleic acids encode nontranslated tRNAs and rRNAs. Tables I and II 
have been revised accordingly. The SEQ ID NOs. in Table II were also revised to reflect the fact that SEQ ID NOs: 89 
25 and 402 were identical in U.S. Provisional Patent Application No. 60/1 1 7,405. 

TABLE 1 

Identified Clones with Corresponding Genes and Qperons 



SEQ ID 


Molecule No. 


Gene 


Gene 


Gene 


CONTIG 


NO. 




(NCBI) 


(Blattner) 


(Rudd) 




1.405 


EcXAOOl 


yhhQ 


3347! 


yhhQ 


AE000423 


2, 406 


EcXA002 


lepB 


lepB 


lepB 


AE000343 


3. 407 


EcXA003 


f586 


b0955 


ycbZ 


AEO00197 


4, 408 


EcXA004 


rpsG, rpsL 


b3341 


rpsG, rpsL 


AE000410 


5, 409 


EcXA005a 


rpIL, rplJ 


b3986 


rpIL, rplJ 


AEO00472 


6,410 


EcXA005b 


rpIL 


rpIL 


rpIL 


AEO00472 


7,411 


EcXA005c 


rpIL, rplJ 


rpIL, rplJ 


rpIL, rplJ 


AEO00472 


8,412 


EcXA005d 


rpIL, rplJ 


rpIL, rplJ 


rpIL, rplJ 


AEO00472 


9,413 


EcXA005e 


rpIL 


rpIL 


rpIL 


AE000472 
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~ SE^ro 

NO. 


Molecule No. 


Gene 
(NCBI) 


Gene 
(Blattner) 


Gene 
(Rudd) 


CONTIG 


10, 414 


EcXA005 


f rpll 


rpll 


rpll 


AE000472 


11,415 


EcXA005[ 


J rpll 


rpll 


rpll 


AE000472 


12, 41 6 


EcXAOOE 


> Ptc 


> b229) 


? Ptc 


r AE000319 


13, 417 


EcXAOO/ 


yicF 


> b366£ 


y/cf 


' AE000444 


I4 t 41 8 


EcXA008c 


yhaL 


9 b312/ 


yhaL 


' AE000394 


15,419 


EcXA008b 


yhaL 


f yhaL 


' yhaL 


f AE000394 


16, 420 


EcXA008c 


yhab 


yhab 


' yhaL 


' AE000394 


17, 421 


EcXA009 


ydeY 


ydeY 


9 ydey 


' AE000249 


18, 422 


EcXAOlOa 
{natural as] 


dicF 


b1575 


dicF 

■ 


AE000253 


19, 423 


EcXAOlOb 


dicF 


dicF 


dicF 


AE000253 


20, 424 


EcXAOH 


fdnG 


b1474 


IdnG 


AE000244 


21, 425 


EcXA012a 


fusA 


b3340 


lusA 


AE000410 


22, 426 


EcXA012b 


fusA 


lusA 


lusA 


AE000410 


23, 427 


EcXA012c 


fusA 


lusA 


fusA 


AE000410 


24, 428 


EcXA013a 


I 086 


b2562 


ylhL 


AE000342 


25, 429 


EcXA013b 


086 


b2562 


yfhL 


AE000342 


26, 430 


EcXA013c 


086 


b2562 


ylhL 


AE000342 


27, 431 


EcXA014 


visC 


b2906 


visC 


AE000374 


28, 432 


EcXA015 


yfdl 


yfdl 


yfdl 


AE000323 


29, 433 


EcXA016 


yeaQ 


yeaO 


yeaQ 


AE000274 






yoaG 


yoaG 


yoaG 




30, 434 


EcXA017a 


yggE 


b2922 


yggE 


AE000375 


31, 435 


EcXA017b 


yggE 


yggE 


yggE 


AE000375 


32, 436 


EcXA018a 


0464 


b2074 


yegM 


AE000297 


33, 437 


: ^EcXAOISb 


0464 


b2074 


yegM 


AE000297 


ox /ion 

34, 438 


EcXA019a 


yehA 


yehA 


yehA 


AE000300 












AE000299 


35, 439 


EcXA019b 


of 72, yehA 


o172, yehA 


0172, yehA 


AE000299 


36, 440 


EcXA020 


o384 J 82 


b 1 794, b 1795 


yeaP, yeaQ 


AE000274 


37, 441 


EcXA021a 


1112 


b0218 


yalU 


AE000130 


38, 442 


EcXA021b 


1112 


b0218 


yalU 


AE000130 


39, 443 


EcXA022 


o740 


b1629 


ydgN 


AE000258 


40, 444 


EcXA023a 


f176, 1382 


b!504, b1505 


ydeS, ydeT 


AE000247 


41. 445 


EcXA023b 


1176, 1382 


b1504, b1505 


ydeS, ydeT 


AE000247 


42, 446 


EcXA024 


YffM YffiN 


b3082 


ygM ygiN 


AE000390 


43, 447 


EcXA025 


02383 


b1878 


yeeJ 


AE000289 


44, 448 


EcXA026 


oBl 


Unpre-dicted 


Unpre-dicted 


AE000138 


45, 449 


EcXA027a 


yohH 


yohH 


yohH 


AE000303 


46, 450 


EcXA027b 


yohH 


yohH 


yohH 


AE000303 


47, 451 


EcXA027c 


yohH 


yohH 


yohH 


AE000303 






yohl 


yohl 


yohl 




48, 452 


EcXA027d 


yohH 


yohH 


yohH 


AE000303 


49, 453 


EcXA028 


1296 


b2305 


yfcl 


AE000319 


50, 454 


EcXA029 


YiiK 


b4391 


yiiK 


AE000509 


51,455 


EcXA030 


yi5A 


D3557 


yiSA 


AE000433 


52,456 


EcXA031 


rplE 


83308 


rplE 


AEQ00408 


53,457 


EcXA032a 


YbgD 


ybgD 


ybgD 


AE000175 


54, 458 


EcXA032b" 


ybgD 


YbgD\ 


ybgD 


AE000175 
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seq 


Molecule No. 


Gene 


Gene 


Gene 


CQNTIG 


luU. 




(NCBI) 


(Blattner) 


(Rudd) 








gltA 


gltA 


git A 




55, 459 


EcXA033a 


f 4 77 (as) 


1)3052 


waaE 


AE000387 












AE000386 


DO, 4bU 


cCaAUojd 


f477 


b3052 


waaE 


AE000387 


C"7 /I C 1 

b/ f 4bl 


EcXA034a 


cspA 


b3556 


cspA 


AE000433 


CO yien 

bo, 462 


EcXA034b 


cspA 


b3556 


cspA 


AE000433 


CD /! CO 

by, 4oJ 


EcXA035 


yhjU 


Y hjU 


yhfU 


AE000431 


en AC A 

60, 464 


EcXA036 


YQjF 


b3101 


yg/F 


AE000392 






o99 


b3W0, 


VQjK 




CI /ICC 

b 1, 4bb 


rr — v a n d "7 

EcXA037 


ydeH 


b!535 


ydeH 


AE000251 


62, 466 


EcXA038 


sieB 


b!353 


sieB 


AE000233 


63, 467 


EcXA039 


ybbD 




ybbD 


AE000156 


64, 468 


EcXA040 


InsB 6 


b3445 


insB 6 


AE000420 


65, 469 


EcXA041 


f234 


b1138 


ymfE 


AE000214 


cc /nn 

bb, 470 


EcXA042a 


rplY 


rplY 


rp/Y 


AE000308 


67, 471 


EcXA042b 


rp/Y 


rplY 


rplY 


AE000308 


CO vl *70 

68, 472 


EcXA043 


ybgB 


ybgB 


ybgB 


AE000176 






cydA 


cydA 


cydA 




69, 473 


EcXA044 


purB 


b1131 


purB 


AE000213 


/u, 4/4 


EcXA045 


csrA 


csrA 


csrA 


AE000353 






serV 


serV 


serV 




71, 475 


EcXA046 


fimE, fimA 


b4313 


fimE, fimA 


AE000502 


72, 476 


EcXA047** 


f96, cspB 


f96, cspB 


cspB, ydfS 


AE000252 


70 >1 "7*7 

73, 477 


EcXA048 


yefE 


yefE 


yefE 


AE000294 


"7 ./■ /no 

74, 478 


, EcXA049 


. yaiC 


b0385 


yaiC 


AE000145 


75, 479 


EcXA050 


o467, o222 


yaiU,yaiV 


yaiU, yaiV 


AE000144 


76, 480 


EcXA051a 


rplB, rplW 


rplB, rptW 


rplB r rplW 


AE000408 


77, 481 


EcXA051b 


rptW 


rplW 


rplW 


AE000408 


78, 482 


EcXA052 


infC 


infC 


infC 


AE000267 












AE000266 


79, 483 


EcXA053 


gor 


gor 


gor 


AE000426 


80, 484 


EcXA054 


rplF 


rplF 


rplF 


AE000408 


81,485 


EcXA055 


rrIG 


rrIG 


rrIG 


AE000345 



EXAMPLE 4 

Identificatio n of Genes and their Corresponding Qperons Affected by Antisense Inhibition 
The sequencing of the entire E. coli genome is described in Blattner et al., Science 277:1453-1474(1997) the 
entirety of which is hereby incorporated by reference and the sequence of the genome is listed in GenBank Accession 
No.U00096, the disclosure of which is incorporated herein by reference in its entirety. The operons to which the 
proliferation-inhibiting nucleic acids correspond were identified using RegulonDB and information in the literature. The 
coordinates of the boundaries of these operons on the E. coli genome are listed in Table III. Table II lists the molecule 
numbers of the inserts containing the growth inhibiting nucleic acid fragments, the genes in the operons corresponding to 
the inserts, the SEQ ID NOs of the genes containing the inserts, the SEQ ID NOs of the proteins encoded by the genes, the 
start and stop points of the genes on the E. coli genome, the orientation of the genes on the genome, whether the operons 
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are predicted or documented, and the predicted functions of the genes. The identified operons, their putative functions, 
and whether or not the genes are presently thought to be required for proliferation are discussed below. 

Functions for the identified genes were determined by using either Blattner functional class designations or by 
comparing identified sequence with known sequences in various databases. A variety of biological functions were noted 
for the genes to which the clones of the present invention correspond. The functions for the genes of interest appear in 
Table II. 

The proteins that are listed in Table II are involved in a wide range of biological functions. 
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Predicted functional class 
of encoded proteins 








Probable integral membrane 
protein Phthalate permease 
familv 




Putative ABC transoorter 










Regulator of cell division 


Anaerobic respiration 
(formate dehvdrooenase) 


■3 






No homologues, no motifs 


Ubiquinone synthesis 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Putative enzymes 


Hypothetical ORF, 
unclassified, unknown 


Carbon compound 
catabolism 


Putative transport proteins 


Putative transport proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Cell processes (incl. 
Adaptation, protection) 


Energy metabolism 


Energy metabolism 


Energy metabolism 




Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Predicted (P) 
Or 

Documented 
(D) 
Operon 


a. 








CU 












p.. 










CU. 




Right 
Coordinate 


3269492 


3270407 


3271198 


3272548 


1601049 


1602071 


1603063 


1604097 

■ 


1604999 


1605313 


CO 
LO 

CO 
CD 


1548472 


1549369 


1550015 




2697943 


3050337 


Left Coordinate 


3268266 


3269508 


3270428 


3271214 


1599514 


1601043 


1602071 


1603075 
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1605023 


105305 


1545425 


1548485 
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t 
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•2. 


visC 


Mole. No. 
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CD 
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X 
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LU 


EcXA014 
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No. 


LO 

CO 


CO 
CD 


r*. 
cn 


OO 
CO 


CO 
CO 


CO 

CD 


Ed 


CNJ 
CD 


OO 
CD 
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Predicted functional class 
of encoded proteins 










Homologues in H. Inf. and S. 
Pombe., no motifs, 
transmem brane region 
oresenf 


- 
















f imflike 


Blattner functional class 
of encoded proteins 


Hypotlielical ORF, 
unclassified, unknown 


Putative chaperones 


Cell structure 




Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical 0RF f 
unclassified, unknown 


Hypothetical 0RF r 
unclassified, unknown 


Transcription, RNA 
processing and degradation 


Hypothetical ORF, 
unclassified, unknown 


hypothetical ORF, 1 
^classified, unknown 


Predicted (P) 
Or 

Documented 
(D) 
Operon 










CL. 




















Right 
Coordinate 


2188930 


.2189665 


2190242 


239084 


1704372 


1704950 


1707165 


1708224 


1708848 


1709547 


1710182 


1586320 


1586863 


Left Coordinate 


2186450 


2188946 


1 2189700 


238746 


1703791 


1704372 


1704943 


1707166 


1708228 


1708852 


1709547 


1585817 


1586333 


Genes On 
Operon 


t 


to 




Same operon 
as EcXA016 
(one of the 
two) 
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i 
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<>> 

t 
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CD 
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u 
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Predicted functional class 
of encoded proteins 


fimdlike 


Weak homology to long 
chain fatty acid coa ligasein 
Archaeolobus M 


Homologues in various ^ 
bacteria 


Strong similarity to 
numerous attaching amd 
effacing proteins and 
invasins 


nifm like 




Xylose binding protein like 




IS 

i 

"Z o 

co ca 
r= to 

.1 § 

co i= 


Similar to ABC transporter L 


O 
CD 
LO 

CO 










Blattner functional class 
of encoded proteins 


Structural proteins 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 




Putative transport proteins 


Hypolhetical ORF, 
unclassified, unknown 


Putative regulatory protein 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Phage, transposon, or 
plasmid 


Translation, post- 
Iranslational modification 


Putative transport proteins 


Translation, post- 
Iranslational modification 


Predicted (P) 
Or 

Documented 
(0) 
Operon 










unpredicted 


cu 






a. 


o_ 


Q_ 




o 






Right 
Coordinate 


1588025 


3231785 


3232096 


2050036 


331184 


2226539 


2226859 


2228405 


2421559 


4628091 


3718830 


3719678 


3440371 


3441734 


3442176 


Left Coordinate 


1586877 


3231369 


3231782 


2042885 


331001 


2225343 


2226569 


2227458 


2420669 


4626424 


3718309 


3718827 


3440255 


3440403 


3441742 


Genes On 
Operon 


t 
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t 


$ 


yohl 








CQ 

to 




prlA 


rpIO 


Mole. No. 
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CD 

<x 
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u 
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CD 
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CJ 
UJ 


-C3 
CO 

r-* 

CNI 

CD 
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X 
UJ 






CJO 
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CD 
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u 
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cn 
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CD 
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UJ 
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Predicted functional class 
of encoded proteins 








Translation 






Translation 






Hypothetical fimbrial protein 

4 


€/ 

c7 
a 
j= 
c 
> 

jE 
a, 
cz 

E 

re 
= 


ADP heptose synthase/ 
autotrnnhin nrnwth nrntpin 


i 

L 

i 

b 
- 




RNA chaperonin 




Blattner functional class 
of encoded proteins 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Cell processes (incl. 
Adaptation, protection) 


Energy metabolism 


Putative enzymes 


Translation, post- 

f ranslational modification 


Hypothetical 0RF, 
unclassified, unknown 


Cell processes (incl. 1 
Adaptation, protection) 


Translation, post- 
ranslational modification 


Predicted (P) 
Or 

Documented 
(D) 
Operon 
































Q_ 
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Coordinate 


3442359 


3442866 


3443234 


3443777 


3444182 


3444521 


3445075 


rr 

CO 

rr 

LO 

rr 
rr 

CO 


• 3445786 


752018 


753691 


3194394 


3197282 


3198606 


3717890 


3695658 


Left Coordinate 


3442180 


3442363 


3442881 


3443244 


3443790 


3444216 


3444536 


3445090 


3445415 


751452 


752408 


3192961 


3194442 


3197305 


3717678 


3694087 


Genes On 
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CJ 
UJ 
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Predicted functional class 
of encoded proteins 


















Lysis protein 
















UDP galactopyranase 
iwtacp 


Jnknown 


a> 

o 
a. 

in 
CZ 

ro 
o 

=3 

ro 

O) 

# > 
ro 

*5 


Blattner functional class 
of encoded proteins 


Cell-structure 


i-en structure 
ICell structure 


Cell structure 


Cell structure 


Cell structure 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Hypothetical ORF, 
unclassified, unknown 


Cell processes (incl. 
Adaptation, protection) 


Phage, transposon, or 
plasmid 


Putative enzymes 


Hypothetical ORF, 
unclassified, unknown 


Putative enzymes 


Hypothetical ORF, 
unclassified, unknown 


Cell structure 


Hypothetical ORF, . I 
jnclassified. unknown i 


hypothetical ORF, 1 
jnclassified, unknown 


5 utative enzymes 1 


Predicted (P) 
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Documented 
(D) 
Operon 




































Q, 


Cw 


Right 
Coordinate 


4541835 


4545301 


! 4545841 


4546357 


4547279 


1638684 


1638081 


1638389 


1638684 


1639578 


2100933 


2101411 


2102531 


2103106 


2104079 


2105248 


2106351 


404042 


393642 


Left Coordinate 


4541188 


4542B65 


4545311 


4545854 


454637:7 


1637054 


1637548 


1638078 


1638394 


1639363 


2099917 


2100938 


2101413 


2102516 


2103087 


2104082 


2105248 


402927 


392239 
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Predicted functional class 
of encoded proteins 


Hypothetical outer 
memhrane nrntpin 


U 












Translation 


Translation 












Translation 


Blattner functional class 
of encoded proteins 


Hypothetical ORF, 
unclassified, unknown 


Translation, post- 
translational modification 


Translation, post* 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
Iranslational modification 


rranslation, post- 
iranslational modification 


Predicted (P) 
| Or 
Documented 
(D) 
Operon 




Q 






















CD 






Right 
Coordinate 


394353 


3446205 


3446396 


3446806 


. 3447520 


3447870 


3448163 


3449001 


3449321 


3449923 


3450563 


3450907 


1797773 


1798023 


1798662 


Left Coordinate 


393685 


3445951 


3446205 


3446396 


3446819 


3447538 


3447885 


3448180 


3449019 


3449318 


3449934 


3450596 


1797417 


1797826 


1798120 
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Predicted functional class 
of encoded proteins 




Glutathione oxido-reductase 




! Translation (rRNA) 


Translation (rRNA) 


Translation (tRNA) 


Translation (rRNA) 


Biattner functional class 
of encoded proteins 


Translation, post- 
translational modification 


Biosynthesis of cofactors, 
prosthetic groups and 
carriers 




Translation, post* 
translational modification 


Translation, post- 
translational modification 


Translation, post- 
translational modification 


Translation, post : 
translational modification 


Predicted (P) 
Or 

Documented 
(D) 
Operon 








CD 








Right 
Coordinate 


1800594 


3645281 


2727204 


2724208 


2727464 


27291.78 


Left Coordinate 
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2724301 


2724089 


2727389 


2727636 
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Several of the expression vectors contain fragments that correspond to genes of unknown function or if the function 
is known, it is not known whether the gene is essential. For example, EcXAODl, 003, 007, 008, 013, 015, 016, 017, 018, 
019. 020, 021, 022, 023, 024, 025, 026, 027. 028. 029. 030. 032. 033, 034, 035, 036, 037, 038, 039, 040. 041, 047, 
048, 049 and 050 are ail exogenous nucleic acid sequences that correspond to £ coli proteins that have no known function or 
5 where the function has not been shown to be essential or nonessential. 

The present invention reports a number of novel £ coli genes and operons that are required for proliferation. From the 
list clone sequences identified here, each was identified to be a portion of a gene in an operon required for the proliferation of £ 
coli. Cloned sequences corresponding to genes already known to be required for proliferation in £ coli include EcXA002, 004, 
005, 010, 012, 014, 031, 02, 043, 045. 051. 052, 054. and 055. The remaining identified sequences correspond to£. coli 
1 0 genes previously undesignated as required for proliferation in the art. 

An interesting observation of the present invention is that there are also several sequence fragments that correspond 
to £ coli genes that are not thought to be required for £ ro// proliferation. Nevertheless, under the conditions described above, 
the antisense expression of these gene fragments causes a reduction in cell growth. This result implies that the genes 
corresponding to the identified sequences are actually required for proliferation. Molecule Nos. corresponding to these genes 
15 are EcXA006, 044, 046, and 053. 

Following identification of the sequences of interest, these sequences were localized into operons. Since bacterial 
genes are expressed in a polycistronic manner, the antisense inhibition of a single gene in an operon might effect the expression 
of all the other genes on the operon or the genes down stream from the single gene identified. In order to determine which of 
the gene products in an operon are required for proliferation, each of the genes contained within an operon may be analyzed for 
20 their effect on viability as described below. 
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TABLE 111 



Qperon Boundaries 



Mole. No. 


Left 
Coordinate 


Right 
Coordinate 


EcXAOOl 


3606848 


3608143 


EcXA002 


2702355 


2703329 


EcXA003 


1015762 


1017522 


EcXA004 


3467782 


3472189 


EcXA005 


4177574 


4178503 


EcXA006 


2412767 


2414911 


EcXA007 


3841591 


3843357 


EcXA008 


3268266 


3272548 


EcXA009 


1599514 


1605313 


EcXAOlO 


1647406 


1647458 


EcXAOn 


1545425 


1550015 


EcXA012 


3467782 


3472189 


EcXA013 


2697683 


2697943 


EcXA014 


3049135 


3053470 


EcXAO-1 5 


2465875 


2468482 


EcXA016 


1877031 


1877972 


EcXA017 


3065360 


3066100 


EcXA018 


2151891 


2160901 


EcXA019 


2185400 


2190242 


EcXA020 


1877031 


1877972 


:: - EcXA021 


238746 


239084 


EcXA022 


1703791 


1710182 


EcXA023 


1585817 


1588025 


EcXA024 


3231369 


3232096 


EcXA025 


2042885 


2050036 


EcXA026 


331001 


331184 


EcXA027c 


2225343 


2228405 


EeXA028 


2420669 


2421559 


EcXA029 


4626424 


4628091 


EcXA030 


3718309 


3719678 


EcXA031 


3440255 


3445786 


EcXA032b 


751452 


753691 


EcXA033 


3192961 


3198606 


EcXA034 


3717678 


3717890 


EcXA035 


3694087 


3697522 


EcXA036 


3246594 


3248594 


EcXA037 


1620984 


1621874 


EcXA038 


1416572 


1417368 


EcXA039 


522485 


52B354 


EcXA040 


3580669 


3580672 


EcXA041 


1196090 


1197460 


EcXA042 


2280537 


2280821 
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15 



20 



Mole. No. 


Left 


Right 




Coordinate 


Coordinate 


Y AO/1 *3 


765207 


773404 


CCAAU44 


1 189839 


1191209 


c r v a n/i c 


2816575 


2817168 


CCaAU^O 


>i r o n f n r 

4538525 


4547279 


r.V A HA "7 

cCaAUh/ 


1637054 


1639578 


tCAAU4o 


2099917 


2106351 


C/* V A ClA O 

tCAAU4y 


402927 


404042 


tCAAUSD 


392239 


394353 


EcXA051 




J45U907 


EcXA052 


1797417 


1800594 


EcXA053 


3643929 


3645281 


EcXA054 


3440255 


3445786 


EcXA055 


2724301 


2729178 



EXAMPLE 5 

Identification of Individual G enes within an Qperon Required for Proliferation 
The following example illustrates a method for determining which gene in an operon is required for proliferation. 
The clone insert corresponding to Molecule No. EcXA004 possesses nucleic acid sequence homology to the£ coff genes 
rspG and rspC This molecule corresponds to an operon containing two additional genes fusA and tufA. The rpsL gene is 
the first gene in the operon. To determine which gene or genes in this operon are required for proliferation, each gene is 
selectively inactivated using homologous recombination. Gene rpsL is the first gene to be inactivated. 

Deletion inactivation of a chromosomal copy of a gene in £ coff can be accomplished by integrative gene 
replacement. The principle of this method (Hamilton, C. M., et al 1989. J. Bacteria/. 171: 4617-4622) is to construct a 
mutant allele of the targeted gene, introduce that allele into the chromosome using a conditional suicide vector/and then 
force the removal of the native wild type allele and vector sequences. This will replace the native gene with a desired 
mutation(s) but leave promoters, operators, etc. intact. Essentiality of a gene is determined either by deduction from 
genetic analysis or by conditional expression of a wild type copy of the targeted gene (trans complementation). 

The first step is to generate a mutant rpsL allele using PCR amplification. Two sets of PCR primers are chosen 
to produce a copy of rpsL with a large central deletion to inactivate the gene. In order to eliminate polar effects, it is 
desirable to construct a mutant allele comprising an in-frame deletion of most or all of the coding region of the rpsl gene. 
Each set of PCR primers is chosen such that a region flanking the gene to be amplified is sufficiently long to allow 
recombination (typically at least 500 nucleotides on each side of the deletion). The targeted deletion or mutation will be 
contained within this fragment. To facilitate cloning of the PCR product, the PCR primers may also contain restriction 
endonuclease sites found in the cloning region of a conditional knockout vector such as pK03 (Link, et al 1997 J. 
Bacterial. 179 (20): 6228-6237). Suitable sites include Notl, Sail. BamHI and Smal. The rpsL gene fragments are 
produced using standard PCR conditions including, but not limited to. those outlined in the manufacturers directions for the 
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Hoi Start Taq PCR kit (Qiagen, Inc.. Valencia, CA). The PCR reactions will produce two fragments that can be fused 
together. Alternatively, crossover PCR can be used to generate a desired deletion in one step (Ho, S. N., et al 1989. Gene 
77: 51-59, Horton. R. M., et al 1989. Gene 77: 61-68). The mutant allele thus produced is called a "null" allele because it 
cannot produce a functional gene product. 
5 The mutant allele obtained from PCR amplification is cloned into the multiple cloning site of pK03. Directional 

cloning of the rpsL null allele is not necessary. The pK03 vector has a temperature-sensitive origin of replication derived 
from pSCIOI. Therefore, clones are propagated at the permissive temperature of 30°C. The vector also contains two 
selectable marker genes: one that confers resistance to chloramphenicol and another, the Bacillus subTilis sacB gene, that 
allows for counter-selection on sucrose containing growth medium. Clones that contain vector DNA with the null allele 
10 inserted are confirmed by restriction endonuclease analysis and DNA sequence analysis of isolated plasmid DNA. The 

plasmid containing the rpsL null allele insert is known as a knockout plasmid. 

Once the knockout plasmid has been constructed and its sequence verified, it is transformed into a Rec* £ coli 
host cell. Transformation can be by any standard method such as electroporation. In some fraction of the transformed 
cells, plasmids will integrate into the £ coli chromosome by homologous recombination between xhe rpsL null allele in the 
15 plasmid and the rpsL gene in the chromosome. Transformant colonies in which such an event has occurred are readily 

selected by growth at the non-permissive temperature of 43°C and in the presence of choramphenicol. At this 
temperature, the plasmid will not replicate as an episome and will be lost from cells as they grow and divide. These cells 
are no longer resistant to chloramphenicol and will not grow when it is present. However, cells in which the knockout 
plasmid has integrated into the £ coli chromosome remain resistant to chloramphenicol and propagate. 
20 Cells con 'a'ning integrated knock out plasmids are usually the result of a single crossover event that creates a 

tandem repeat of the mutant and native wild type alleles of rpsL separated by the vector sequences. A consequence of 
this is that rpsL will still be expressed in these cells. In order to determine if the gene is essential for growth, the wild type 
copy must be removed. This is accomplished by selecting for plasmid excision, a process in which homologous 
recombination between the two alleles results in looping out of the plasmid sequences. Cells that have undergone such an 
25 excision event and have lost plasmid sequences including sacB gene are selected for by addition of sucrose to the medium. 

The sacB gene product converts sucrose to a toxic molecule. Thus counter selection with sucrose ensures that plasmid 
sequences are no longer present in the cell. Loss of plasmid sequences is further confirmed by testing for sensitivity to 
chloramphenicol (loss of the chloramphenicol resistance gene). The latter test is important because occasionally a 
mutation in the sacB gene can occur resulting in a loss of sacB function with no effect on plasmid replication (Link, et. al.. 
30 1997 J. Bacterial. 179 (20): 6228-6237). These artifact clones retain plasmid sequences and are therefore still resistant 

to chloramphenicol. 

In the process of plasmid excision, one of the two rpsL alleles is lost from the chromosome along with the 
plasmid DNA. In general, it is equally likely that the null allele or the wild type allele will be lost. Therefore, if the rpsL 
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gene is not.essential, half of the clones obtained in this experiment will have the wild type allele on the chromosome and 
half will have the null allele. However, if the rpsL gene is essential, cells containing the null allele will not be obtained as a 
single copy of the null allele would be lethal. 

To determine the essentiality of rpsL, a statistically significant number of the resulting clones, at least 20, are 
5 analyzed by PCR amplification of the rpsL gene. Since the null allele is missing a significant portion of the rpsL gene, its 

PCR product is significantly shorter than that of the wild type gene and the two are readily distinguished by gel 
electrophoretic analysis. The PCR products may also be subjected to sequence determination for further confirmation by 
methods well known to those in the art. 

The above experiment is generally adequate for determining the essentiality of a gene such as rpsL. However, it 
1 0 may be necessary or desirable to more directly confirm the essentiality of the gene. There are several methods by which 

this can be accomplished. In general, these involve three steps: 1) construction of an episome containing a wild type 
allele, 2) isolation of clones containing a single chromosomal copy of the mutant null allele as described above but in the 
presence of the episomai wild type allele, and then 3) determining if the cells survive when the expression of the episomal 
allele is shut off. In this case, the trans copy of wild type rpsL is made by PCR cloning of the entire coding region of rpsL 
15 and inserting it in the sense orientation downstream of an inducible promoter such as the £ coli lac promoter. 

Transcription of this allele of rpsL will be induced in the presence of IPTG which inactivates the lac repressor. Under IPTB 
induction rpsL protein will be expressed as long as the recombinant gene also possesses a ribosomal binding site, also 
known as a "Shine-Dalgarno Sequence". The trans copy of rpsL is cloned on a plasmid that is compatible with pSC101. 
Compatible vectors include pl5A, pBR322. and the pUC plasmids, among others. Replication of the compatible plasmid 
20 will not be temperature-sensitive. The entire process of integrating the null allele of rpsL and subsequent plasmid excision 

is carried out in the presence of IPTG to ensure the expression of functional rpsL protein is maintained throughout. After 
the null rpsL allele is confirmed as integrated on the chromosome in place of the wild type rpsL allele, then IPTG is 
withdrawn and expression of functional rpsL protein shut off. If the rpsL gene is essential, cells will cease to proliferate 
under these conditions. However, if the rpsL gene is not essential, cells will continue to proliferate under these conditions. 
25 In this experiment, essentiality is determined by conditional expression of a wild type copy of the gene rather than inability 

to obtain the intended chromosomal disruption. 

An advantage of this method over some other gene disruption techniques is that the targeted gene can be deleted 
or mutated without the introduction of large segments of foreign ONA. Therefore, polar effects on downstream genes are 
eliminated or minimized. There are methods described to introduce inducible promoters upstream of potential essential 
30 bacterial genes. However in such cases, polarity from multiple transcription start points can be a problem. One way of 

preventing this is to insert a gene disruption cassette that contains strong transcriptional terminators upstream of the 
integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J. Bacteriol. 178 (12): 3614-3620). The described 
techniques will all be familiar to one of ordinary skill in the art. 
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Following the analysis of the rpsL gene, the other genes of the operon are investigated to determine if they are 
required for proliferation. 

EXAMPLE 6 

Expression of the Proteins Encoded by Genes Identified as Required for £ coli Proliferation 
The following is provided as one exemplary method to express the proliferation-required proteins encoded by the 
identified sequences described above. First, the initiation and termination codons for the gene are identified. If desired, 
methods for improving translation or expression of the protein are well known in the art. For example, if the nucleic acid 
encoding the polypeptide to be expressed lacks a methionine codon to serve as the initiation site, a strong Shine-Delgarno 
sequence, or a stop codon, these sequences can be added. Similarly, if the identified nucleic acid sequence lacks a transcription 
termination signal, this sequence can be added to the construct by, for example, splicing out such a sequence from an 
appropriate donor sequence. In addition, the coding sequence may be operably linked to a strong promoter or an inducible 
promoter if desired. The identified nucleic acid sequence or portion thereof encoding the polypeptide to be expressed is obtained 
by PCR from the bacterial expression vector or genome using oligonucleotide primers complementary to the identified nucleic 
acid sequence or portion thereof and containing restriction endonuclease sequences for A/col incorporated into the 5' primer and 
1 5 Bg/H at the 5' end of the corresponding 3'-primer, taking care to ensure that the identified nucleic acid sequence is positioned in 

frame with the termination signal. The purified fragment obtained from the resulting PCR reaction is digested with Nco\ and 
Bg/ft, purified and ligated to an expression vector. 

The ligated product is transformed into DH5a or some other £ coli strain suitable for the over expression of potential 
proteins. Transformation protocols are well known in the art. For example, transformation protocols are described in: Current 
20 Protocols in Molecular Biology, Vol. 1. Unit . 1.8, (Ausubel, et a!., Eds.) John Wiley & Sons, Inc. (1997). Positive 

transformants are selected after growing the transformed cells on plates containing 50-100 /yg/ml Ampicillin (Sigma, St. Louis, 
Missouri). In one embodiment, the expressed protein is held in the cytoplasm of the host organism. In an alternate embodiment, 
the expressed protein is released into the culture medium. In still another alternative, the expressed protein can be sequestered 
in the periplasmic space and liberated therefrom using any one of a number of cell lysis techniques known in the art. For 
25 example, the osmotic shock cell lysis method described in Chapter 16 of Current Protocols in Molecular Biology, Vol. 2, 

(Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997). Each of these procedures can be used to express a proliferation-required 
protein. 

Expressed proteins, whether in the culture medium or liberated from the periplasmic space or the cytoplasm, are then 
purified or enriched from the supernatant using conventional techniques such as ammonium sulfate precipitation, standard 
30 chromatography, immunoprecipitation, immunochromatography, size exclusion chromatography, ion exchange chromatography, 

and HPLC. Alternatively, the secreted protein can be in a sufficiently enriched or pure state in the supernatant or growth media 
of the host to permit it to be used for its intended purpose without further enrichment. The purity of the protein product 
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obtained can be assessed using techniques such as Coomassie or silver staining or using antibodies against the control protein. 
Coomassie and silver staining techniques are familiar to those skilled in the an. 

Antibodies capable of specifically recognizing the protein of interest can be generated using synthetic peptides using 
methods well known in the art. See, Antibodies: A Laboratory Manual, (Harlow and Lane, Eds.) Cold Spring Harbor 
5 Laboratory (1988). For example, 15-mer peptides having a sequence encoded by the appropriate identified gene sequence of 

interest or portion thereof can be chemically synthesized. The synthetic peptides are injected into mice to generate antibodies 
to the polypeptide encoded by the identified nucleic acid sequence of interest or portion thereof. Alternatively, samples of the 
protein expressed from the expression vectors discussed above can be purified and subjected to amino acid sequencing analysis 
to confirm the identity of the recombinantly expressed protein and subsequently used to raise antibodies. An Example 

1 0 describing in detail the generation of monoclonal and polyclonal antibodies appears in Example 7. 

The protein encoded by the identified nucleic acid sequence of interest or portion thereof can be purified using 
standard immunochromatography techniques. In such procedures, a solution containing the secreted protein, such as the 
culture medium or a cell extract, is applied to a column having antibodies against the secreted protein attached to the 
chromatography matrix. The secreted protein is allowed to bind the immunochromatography column. Thereafter, the column is 

15 washed to remove non-specifically bound proteins. The specifically bound secreted protein is then released from the column 

and recovered using standard techniques. These procedures are well known in the art. 

In an alternative protein purification scheme, the identified nucleic acid sequence of interest or portion thereof can be 
incorporated into expression vectors designed for use in purification schemes employing chimeric polypeptides. In such 
strategies the coding sequence of the identified nucleic acid sequence of interest or portion thereof is inserted in-frame with the 

20 gene encoding the other half of the chimera. The other half of the chimera can be maltose binding protein (MBP) or a nickel 

binding polypeptide encoding sequence. A chromatography matrix having antibody to MBP or nickel attached thereto is then 
used to purify the chimeric protein. Protease cleavage sites can be engineered between the MBP gene or the nickel binding 
polypeptide and the identified expected gene of interest, or portion thereof. Thus, the two polypeptides of the chimera can be 
separated from one another by protease digestion. 

25 ^ One useful expression vector for generating maltose binding protein fusion proteins is pMAL (New England Biolabs), 

which encodes the malE gene. In the pMal protein fusion system, the cloned gene is inserted into a pMal vector downstream 
from the malE gene. This results in the expression of an MBP-fusion protein. The fusion protein is purified by affinity 
chromatography. These techniques as described are well known to those skilled in the art of molecular biology. 

EXAMPLE 7 

30 Production of an Antibody to an isolated £ cofi Protein 

Substantially pure protein or polypeptide is isolated from the transformed cells as described in Example 6. The 
concentration of protein in the final preparation is adjusted, for example, by concentration on a 10,000 molecular weight cut off 



-44- 



BNSDOCID: <WO 0044&06A2J_> 



WO 00/44906 . pct/usoO/02200 



AMICON filter device (Millipore. Bedford. MA), to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from 
5 murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 256:495 (1 975) or any of the well- 

known derivative methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein or 
peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the 
spleen isolated. The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused 
cells destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused cells 
1 0 are diluted and aliquots of the dilution placed in wells of a microliter plate where growth of the culture is continued. Antibody- 

producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, such 
as ELISA, as described by Engvall, E., "Enzyme immunoassay ELISA and EMIT," Meth. Enzymol. 70:419 (1980), and 
derivative methods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. 
Detailed procedures for monoclonal antibody production are described in Davis, L. et al. Basic Methods in Molecular Biology 
15 Elsevier, New York. Section 21-2. 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein or a peptide can be prepared 
by immunizing suitable animals with the expressed protein or peptides derived therefrom described above, which can be 
unmodified or modified to enhance immunogenicity. Effective polyclonal antibody production is affected by many factors 
20 related both to the antigen and the host species. For example, small molecules tend to be less immunogenic than larger 

molecules and can require the use of carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigen administered 
at multiple intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in 
Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971). 
25 Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as 

determined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See, for example, Ouchterlony, 0. et al.. Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) 
Blackwell (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 M). 
Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, for example, by 
Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., 
Washington, D.C. (1980). 

Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which determine 
concentrations of antigen-bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 
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identify the presence of antigen in a biological sample. The antibodies can also be used in therapeutic compositions for killing 
bacterial cells expressing the protein. 

EXAMPLE 8 

Screening Chemical Libraries 

A. Protein-Based Assays 

Having isolated and expressed bacterial proteins shown to be required for bacterial proliferation, the present invention 
further contemplates the use of these expressed proteins in assays to screen libraries of compounds for potential drug 
candidates. The generation of chemical libraries is well known in the art. For example combinatorial chemistry can be used to 
generate a library of compounds to be screened in the assays described herein. A combinatorial chemical library is a collection 
of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed 
by combining amino acids in every possible combination to yield peptides of a given length. Millions of chemical compounds 
theoretically can be synthesized through such combinatorial mixings of chemical building blocks. For example, one commentator 
observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical 
synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds. (Gallop et al., "Applications of 
Combinatorial Technologies to Drug Discovery, Background and Peptide Combinatorial Libraries." Journal of Medicinal 
Chemistry, Vol. 37, No. 9, 1233-1250 (1994). Other chemical libraries known to those in the art may also be used, including 
natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that possess desirable biological properties. 
For example, compounds which may be useful as drugs or to develop drugs would likely have the ability to bind to the target 
protein identified, expressed and purified as discussed above. Further, if the identified target protein is an enzyme, candidate 
compounds would likely interfere with the enzymatic properties of the target protein. Any enzyme can be a target protein. For 
example, the enzymaric function of a target protein can be to serve as a protease, nuclease, phosphatase, dehydrogenase, 
transporter protein, transcriptional enzyme, and any other type of enzyme known or unknown. Thus, the present invention 
25 contemplates using the protein products described above to screen combinatorial chemical libraries. 

Those in the art will appreciate that a number of techniques exist for characterizing target proteins in order to identify 
molecules useful for the discovery and development of therapeutics. For example, some techniques involve the generation and 
use of small peptides to probe and analyze target proteins both biochemically and genetically in order to identify and develop 
drug leads. Such techniques include the methods described in PCT publications No. W09935494, W098 1 9 1 62, W09954728, 
30 the disclosures of which are incorporated herein by reference in their entireties. 

In another example, the target protein is a serine protease and the substrate of the enzyme is known. The present 
example is directed towards the analysis of libraries of compounds to identify compounds that function as inhibitors of the 
target enzyme. First, a library of small molecules is generated using methods of combinatorial, library formation well known in 
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the art. U.S. Patent NOs. 5,463,564 and 5,574, 656, to Agrafiotis, et aL, entitled "System and Method of Automatically 
Generating Chemical Compound with Desired Properties/ are two such teachings. Then the library compounds are screened to 
identify library compounds that possess desired structural and functional properties. U.S. Patent No. 5,684,71 1 also discusses 
a method for screening libraries. 

To illustrate the screening process, the combined target and chemical compounds of the library are exposed to and 
permitted to interact with the purified enzyme. A labeled substrate is added to the incubation. The label on the substrate is 
such that a detectable signal is emitted from metabolized substrate molecules. The emission of this signal permits one to 
measure the effect of the combinatorial library compounds on the enzymatic activity of target enzymes. The characteristics of 
each library compound is encoded so that compounds demonstrating activity against the enzyme can be analyzed and features 
common to the various compounds identified can be isolated and combined into future iterations of libraries. 

Once a library of compounds is screened, subsequent libraries are generated using those chemical building blocks that 
possess the features shown in the first round of screen to have activity against the target enzyme. Using this method, 
subsequent iterations of candidate compounds will possess more and more of those structural and functional features required 
to inhibit the function of the target enzyme, until a group of enzyme inhibitors with high specificity for the enzyme can be found. 
These compounds can then be further tested for their safety and efficacy as antibiotics for use in mammals. 

It will be readily appreciated that this particular screening methodology is exemplary only. Other methods are 
well known to those skilled in the art. For example, a wide variety of screening techniques are known for a large number 
of naturally-occurring targets when the biochemical function of the target protein is known. 
B. Cell Based Assays 

Current cell-based assays used to identify or to characterize compounds for drug discovery and development 
frequently depend on detecting the ability of a test compound to inhibit the activity of a target molecule located within a 
cell or located on the surface of a cell. Most often such target molecules are proteins such as enzymes, receptors and the 
like. However, target molecules may also include other molecules such as DNAs, lipids, carbohydrates and RNAs including 
messenger RNAs, ribosomal RNAs, tRNAs and the like. A number of highly sensitive cell-based assay methods are 
available to those of skill in the art to detect binding and interaction of test compounds with specific target molecules. 
However, these methods are generally not highly effective when the test compound binds to or otherwise interacts with 
its target molecule with moderate or low affinity. In addition, the target molecule may not be readily accessible to a test 
compound in solution, such as when the target molecule is located inside the cell or within a cellular compartment such as 
the periplasm of a bacterial cell. Thus, current cell-based assay methods are limited in that they are not effective in 
identifying or characterizing compounds that interact with their targets with moderate to low affinity or compounds that 
interact with targets that are not readily accessible. 

. Cell-based assay methods of the present invention have substantial advantages over current cell-based assays 
practiced in the art. These advantages derive from the use of sensitized cells in which the level or activity of a 
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proliferation-required gene product (the target molecule) has been specifically reduced to the point where the presence or 
absence of its function becomes a rate-determining step for cellular proliferation. Bacterial, fungal, plant, or animal cells 
can all be used with the present method. Such sensitized cells become much more sensitive to compounds that are active 
against the affected target molecule. Thus, cell-based assays of the present invention are capable of detecting compounds 
5 exhibiting low or moderate potency against the target molecule of interest because such compounds are substantially more 

potent on sensitized cells than on non-sensitized cells. The affect may be such that a test compound may be two to 
several times more potent, at least 10 times more potent or even at least 100 times more potent when tested on the 
sensitized cells as compared to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic microorganisms and to the 
0 significant side-effects associated with some currently used antibiotics, novel antibiotics acting at new targets are highly 

sought after in the art. Yet, another limitation in the current art related to cell-based assays is the problem of identifying 
hits against the same kinds of target molecules in the same limited set of biological pathways over and over again. This 
may occur when compounds acting at such new targets are discarded, ignored or fail to be detected because compounds 
acting at the "old" targets are encountered more frequently and are more potent than compounds acting at the new 
5 targets. As a result, the majority of antibiotics in use currently interact with a relatively small number of target molecules 

within an even more limited set of biological pathways. 

The use of sensitized ceils of the current invention provides a solution to the above problem in two ways. First, 
desired compounds acting at a target of interest, whether a new target or a previously known but poorly exploited target, 
can now be detected above the "noise" of compounds acting at the "old" targets due to the specific and substantial 
3 increase in potency of such desired compounds when tested on the sensitized cells of the current invention. Second, the 

methods used to sensitize cells to compounds acting at a target of interest may also sensitize these cells to compounds 
acting at other target molecules within the same biological pathway. For example, expression of an antisense molecule 
to a gene encoding a ribosomal protein is expected to sensitize the cell to compounds acting at that ribosomal protein and 
may also sensitize the cells to compounds acting at any of the ribosomal components (proteins or rRNA) or even to 
3 compounds acting at any target which. is part of the protein synthesis pathway. Thus an important advantage of the 

present invention is the ability to reveal new targets and pathways that were previously not readily accessible to drug 
discovery methods. 

Sensitized ceils of the present invention are prepared by reducing the activity or level of a target molecule. The 
target molecule may be a gene product, such as an RNA or polypeptide produced from the proliferation-required nucleic 
) acids described herein. Alternatively, the target may be a gene product such as an RNA or polypeptide which is produced 

form a sequence within the same operon as the proliferation-required nucleic acids described herein. In addition, the target 
may be an RNA or polypeptide in the same biological pathway as the proliferation-required nucleic acids described herein. 
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Such biological pathways include, but are not limited to, enzymatic, biochemical and metabolic pathways as well as 
pathways involved in the production of cellular structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial chemistries are able to make use of 
structure-activity relationship information derived from testing compounds in various biological assays including direct 
5 binding assays and cell-based assays. Occasionally compounds are directly identified in such assays that are sufficiently 

potent to be developed as drugs. More often, initial hit compounds exhibit moderate or low potency. Once a hit compound 
is identified with low or moderate potency, directed libraries of compounds are synthesized and tested in order to identify 
more potent leads. Generally these directed libraries are combinatorial chemical libraries consisting of compounds with 
structures related to the hit compound but containing systematic variations including additions, subtractions and 
10 . substitutions of various structural features. When tested for activity against the target molecule, structural features are 
identified that either alone or in combination with other features enhance or reduce activity. This information is used to 
design subsequent directed libraries containing compounds with enhanced activity against the target molecule. After one 
or several iterations of this process, compounds with substantially increased activity against the target molecule are 
identified and may be further developed as drugs. This process is facilitated by use of the sensitized cells of the present 
15 invention since compounds acting at the selected targets exhibit increased potency in such cell-based assays, thus; more 

compounds can now be characterized providing more useful information than would be obtained otherwise. 

Thus, it is now possible using cell-based assays of the present invention to identify or characterize compounds 
that previously would not have been readily identified or characterized including compounds that act at targets that 
previously were not readily exploited using cell-based assays. The process of evolving potent drug leads from initial hit 
20 compounds is also substantially improved by the cell-based assays of the present invention because, for the same number 

of test compounds, more structure-function relationship information is likely to be revealed. 

The method of sensitizing a cell entails selecting a suitable gene or operon. A suitable gene or operon is one whose 
expression is required for the proliferation of the cell to be sensitized. The next step is to introduce into the cells to be 
sensitized, an antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA encoded by the suitable 
25 gene or operon. Introduction of the antisense RNA can be in the form of an expression vector in which antisense RNA is 

produced under the control of an inducible promoter. The amount of antisense RNA produced is limited by varying the 
inducer concentration to which the cell is exposed and thereby varying the activity of the promoter driving transcription of 
the antisense RNA. Thus, cells are sensitized by exposing them to an inducer concentration that results, in a sub-lethal 
level of antisense RNA expression. 
30 ln one embodiment of the cell-based assays, the identified exogenous £ a?// nucleotide sequences of the present 

invention are used to inhibit the production of a proliferation-required protein. Expression vectors producing antisense RNA 
against identified genes required for proliferation are used to limit the concentration of a proliferation-required protein 
without severly inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is calculated by plotting 
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various doses of inducer against the corresponding growth inhibition caused by the antisense expression. From this curve, 
various percentages of antisense induced growth inhibition, from 1 to 100% can be determined. If the promoter contained 
in the expression vector contains a lac operator the transcription is regulated by lac repressor and expression from the 
promoer is inducible with IPTG. For example, the highest concentration of the inducer IPTG that does not reduce the 
5 growth rate (0% growth inhibition) can be predicted from the curve. Cellular proliferation can be monitored by growth 

medium turbidity via OD measurements. In another example, the concentration of inducer that reduces growth by 25% 
can be predicted from the curve. In still another example, a concentration of inducer that reduces growth by 50% can be 
calculated. Additional parameters such as colony forming units (cfu) can be used to measure cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of inducer. The presence of the inducer at 
1 0 this sub lethal concentration reduces the amount of the proliferation required gene product to the lowest amount in the cell 

that will support growth. Cells grown in the presence of this concentration of inducer are therefore specifically more 
sensitive to inhibitors of the proliferation-required protein or RNA of interest or to inhibitors of proteins or FIN As in the same 
biological pathway as the proliferation-required protein or RNA of interest but not to inhibitors of unrelated proteins or 
RNAs. 

15 Cel,s Pleated with sub-inhibitory concentrations of inducer and thus containing a reduced amount of 

proliferation-required target gene product are then used to screen for compounds that reduce cell growth. The sub-lethal 
concentration of inducer may be any concentration consistent with the intended use of the assay to identify candidate 
compounds to which the cells are more sensitive. For example, the sub lethal concentration of the inducer may be such 
that growth inhibition is at least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 

20 30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, or more. Cells which are pre- 

sensitized using the preceding method are more sensitive to inhibitors of the target protein because these cells contain less 
target protein to inhibit than wild-type cells. 

In another embodiment of the cell based assays of the present invention, the level or activity of a proliferation required 
gene product is reduced using a temperature sensitive ...mutation in the proliferation-required sequence and an antisense 

25 nucleic acid against the proliferation-required sequence. Growing the cells at an intermediate temperature between the 

permissive and restrictive temperatures of the temperature sensitive mutant where the mutation is in a proliferation- 
required gene produces cells with reduced activity of the proliferation-required gene product. The antisense RNA directed 
against the proliferation-required sequence further reduces the activity of the proliferation required gene product. Drugs 
that may not have been found using either the temperature sensitive mutation or the antisense nucleic acid alone may be 

30 identified by determining whether cells in which expression of the antisense nucleic acid has been induced and which are 

grown at a temperature between the permissive temperature and the restrictive temperature are substantially more 
sensitive to a test compound than cells in which expression of the antisense nucleic acid has not been induced and which 
are grown at a permissive temperature. Also drugs found previously from either the antisense nucleic acid alone or the 
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temperature sensitive mutation alone may have a different sensitivity profile when used in cells combining the two 
approaches, and that sensitivity profile may indicate a more specific action of the drug in inhibiting one or more activities 
of the gene product. 

Temperature sensitive mutations may be located at different sites within the gene and correspond to different 
domains of the protein. For example, the dnaB gene of Escherichia a?// encodes the replication fork DNA helicase. DnaB 
has several domains, including domains for oligomerization, ATP hydrolysis, DNA binding, interaction with primase, 
interaction with DnaC, and interaction with DnaA KBiswas, E.E. and Biswas, S.B. 1999. Mechanism and DnaB helicase of 
Escherichia co/i: structural domains involved in ATP hydrolysis, DNA binding, and oligomerization. Biochem. 38:10919- 
10928; Hiasa, H. and Marians, K.J. 1999. Initiation of bidirectional replication at the chromosomal origin is directed by the 
interaction between ,helicase and primase.. J. Biol. Chem. 274:27244-27248; San Martin, C, Radermacher, M., 
Wolpensinger, B. ( Engel, A., Miles, C.S., Dixon, N.E., and Carazo, J.M. 1998. Three-dimensional reconstructions from 
cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its loading partner DnaC. 
Structure 6:501-9; Sutton, M.D., Carr, K.M., Vicente, M., and Kaguni, J.M. 1998. Escherichia coffDmk protein. The N- 
terminal domain and loading of DnaB helicase at the £ coii chromosomal. J. Biol. Chem. 273:34255-62.), the disclosures 
of which are incorporated herein by reference in their entireties]. Temperature sensitive mutations in different domains of 
DnaB confer different phenotypes at the restrictive temperature, which include either an abrupt stop or slow stop in DNA 
replication with or without DNA breakdown {Wechsler, J.A. and Gross, J.D. 1971. Escherichia co/i mutants temperature- 
sensitive for DNA synthesis. Mol. Gen. Genetics 1 13:273-284, the disclosure of which is incorporated herein by reference 
in its entirety) and termination of growth or cell death. Combining the use of temperature sensitive mutations in the dnaB 
gene that cause cell death at the restrictive temperature with an antisense to the dnaB gene could lead to the discovery of 
very specific and effective inhibitors of one or a subset of activities exhibited by DnaB. 

When screening for antimicrobial agents against a gene product required for proliferation, growth inhibition of cells 
containing a limiting amount of that proliferation-required gene product can be assayed. Growth inhibition can be 
measured by directly comparing the amount of growth, measured by the optical density of the growth medium, between an 
experimental sample and a control sample. Alternative methods for assaying cell proliferation include measuring green 
fluorescent protein (GFP) reporter construct emissions, various enzymatic activity assays, and other methods well known 
in the art. 

It will be appreciated that the above method may be performed in solid phase, liquid phase or a combination of 
the two. For example, cells grown on nutrient agar containing the inducer of the antisense construct may be exposed to 
compounds spotted onto the agar surface. A compound's effect may be judged from the diameter of the resulting killing 
zone, the area around the compound application point in which cells do not grow. Multiple compounds may be transferred 
to agar plates and simultaneously tested using automated and semi-automated equipment including but not restricted to 
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multi channel pipettes (for example the Beckman Multimek) and multi channel spotters (for example the Genomic Solutions 
Flexys). In this way multiple plates and thousands to millions of compounds may be tested per day. 

The compounds may also be tested entirely in liquid phase using microliter plates as described below. Liquid 
phase screening may be performed in microliter plates containing 96, 384. 1536 or more wells per microliter plate to 
5 screen multiple plates and thousands to millions of compounds per day. Automated and semi-automated equipment may be 

used for addition of reagents (for example cells and compounds) and determination of cell density. 

EXAMPLE 9 

The effectiveness of the above cell based assay was validated using constructs expressing antisense RNA to£ 
coli genes rpIL, rplJ, and rplW encoding ribosomal proteins L7/L12, L10 and L23 respectively. These proteins are part of 
1 0 the protein synthesis apparatus of the cell and as such are required for proliferation. These constructs were used to test 

the effect of antisense expression on cell sensitivity to antibiotics known to bind to the ribosome and thereby inhibit 
protein synthesis. Constructs expressing antisense RNA to several other genes (elaD, visC. yohH, and aptE/B), the 
products of which are not involved in protein synthesis were used for comparison. 

First expression vectors containing antisense constructs to either rplW or to elaD were introduced into separate 
1 5 £ coli cell populations. Vector introduction is a technique well known to those of ordinary skill in the art. The expression 

vectors of this example contain IPTG inducible promoters that drive the expression of Ihe antisense RNA in the presence of 
the inducer. However, those skilled in the art will appreciate that other inducible promoters may also be used. Suitable 
expression vectors are also well known in the art. The£ coli antisense clones encoding ribosomal proteins L7/L12. L10 
and L23 were used to test the effect of antisense expression on cell sensitivity to the antibiotics known to bind to these 
20 proteins. First, expression vectors containing antisense to either the genes encoding L7/L12 and L10 or L23 were 

introduced into separate E. coli cell populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid medium to obtain the growlh 
inhibitory dose curve for each clone (Fig. 1). First, seed cultures were grown to a particular turbidity that is measured by 
the optical density (OD) of the growth solution. The OD of the solution is directly related to the number of bacterial cells 
25 contained therein. Subsequently, sixteen 200 ul liquid medium cultures were grown in a 96 well microliter plate at 37 C 

with a range of IPTG concentrations in duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). 
Additionally, control cells were grown in duplicate without IPTG. These cultures were started from equal amounts of cells 
derived from the same initial seed culture of a clone of interest. The cells were grown for up to 15 hours and the extent of 
growth was determined by measuring the optical density of the cultures at 600 nm. When the control culture reached mid- 
30 log phase the percent growth of the control for each of the IPTG containing cultures was plotted against the log 
concentrations of IPTG to produce a growth inhibitory dose response curve for the IPTG. The concentration of IPTG that 
inhibits cell growth to 50% (IC M ) as compared to the 0 mM IPTG control (0% growth inhibition) was then calculated from 
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the curve. Under these conditions, an amount of antisense RNA was produced that reduced the expression levels of rplW 
and elaD to a degree such that growth was inhibited by 50%. 

Alternative methods of measuring growth are also contemplated. Examples of these methods include 
measurements of proteins, the expression of which is engineered into the cells being tested and can readily be measured. 
5 Examples of such proteins include green fluorescent protein (GFP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to test the sensitivity of cell 
populations to tetracycline, erythromycin and other protein synthesis inhibitors. An example of a tetracycline dose 
response curve is shown in Figures 2A and 2B for the rplW and elaD genes, respectively. Cells were grown to log phase 
and then diluted into media alone or media containing IPTG at concentrations which give 20% and 50% growth inhibition 
10 as determined by IPTG dose response curves. After 2.5 hours, the cells were diluted to a final 0D60O of 0.002 into 96 

well plates containing (1)4-/. IPTG at the same concentrations used for the 2.5 hour preincubation; and (2) serial two-fold 
dilutions of tetracycline such that the final concentrations of tetracycline range from r ng/ml to 15.6 ng/mi and 0 ng/ml. 
The 96 well plates were incubated at 37°C and the OD600 was read by a plate reader every 5 minutes for up to 15 hours. 
For each IPTG concentration and the no IPTG control, tetracycline dose response curves were determined when the control 
15 (absence of tetracycline) reached 0.1 OD600. To compare tetracycline sensitivity with and without IPTG, tetracycline 

IC50s were determined from the dose response curves (Figs. 2A-B). Cells with reduced levels of L23 (rplW) showed 
increased sensitivity to tetracycline (Fig. 2A) as compared to cells with reduced levels of elaD (Fig. 2B). Figure 3 shows a 
summary bar chart in which the ratios of tetracycline IC50s determined in the presence of IPTG which gives 50% growth 
inhibition versus tetracycline IC50s determined without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells 
20 with reduced levels of either L7/L12 (genes rpIL, rplJ) or L23 (rplW) showed increased sensitivity to tetracycline (Fig. 3). 

Cells expressing antisense to genes not known to be involved in protein synthesis {atpB/E. visC, elaD, yohH) did not show 
the same increased sensitivity to tetracycline, validating the specificity of this assay (Fig. 3). 

In addition to the above, it has been observed in initial experiments that clones expressing antisense RNA to 
genes involved in protein synthesis (including genes encoding ribosoma! proteins L7/L12 & L10, L7/L12 alone, L22, and 
25 L18, as well as genes encoding rRNA and Elongation Factor G) have increased sensitivity to the macrolide, erythromycin, 

whereas clones expressing antisense to the non-protein synthesis genes elaD, atpB/E and visC do not. Furthermore, the 
clone expressing antisense to rpIL and rplJ does not show increased sensitivity to nalidixic acid and ofloxacin, antibiotics 
which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rpIL, rplJ, and rplW as well as the initial results using various other 
30 antisense clones and antibiotics show that limiting the concentration of an antibiotic target makes cells more sensitive to 

the antimicrobial agents that specifically, interact with that protein. The results also show that these cells are sensitized 
to antimicrobial agents that inhibit the overall function in which the protein target is involved but are not sensitized to 
antimicrobial agents that inhibit other functions. 
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The cell based assay described above may also be used to identify the biological pathway in which a 
proliferation-required nucleic acid or its gene product lies. In such methods, cells expressing a sublethal level of antisense 
to a target proliferation required nucleic acid and control cells in which expression of the antisense has not been induced 
are contacted with a panel of antibiotics known to act in various pathways. If the antibiotic acts in the pathway in which 
5 the -target proliferation-required nucleic acid or its gene product lies, cells in which expression of the antisense has been 

induced will be more sensitive to the antibiotic than cells in which expression of the antisense has not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of cells expressing antisense 
nucleic acids to many different proliferation-required genes including the target proliferation-required gene. If the antibiotic 
is acting specifically, heightened sensitivity to the antibiotic will be observed only in the cells expressing antisense to a 
10 target proliferation-required gene (or cells expressing antisense to other proliferation-required genes in the same pathway 

as the target proliferation-required gene) but will not be observed generally in all cells expressing antisense to proliferation- 
required genes. 

Similarly, the above method may be used to determine the pathway on which a test antibiotic acts. A panel of 
cells, each of which expresses antisense to a proliferation-required nucleic acid in a known pathway, is contacted with a 
15 compound for which it is desired to determine the pathway on which it acts. The sensitivity of the panel of cells to the 

test compound is determined in cells in which expression of the antisense has been induced and in control cells in which 
expression of the antisense has not been induced. If the test antibiotic acts on the pathway on which an antisense nucleic 
acid acts, cells in which expression of the antisense has been induced will be more sensitive to the antibiotic than cells in 
which expression of the antisense has not been induced. In addition, control cells in which expression of antisense to 
20 proliferation-required genes in other pathways has been induced will not exhibit heightened sensitivity to the antibiotic. In 

this way, the pathway on which the test antibiotic acts may be determined. 

The Example below provides one method for performing such assays. 

EXAMPLE 10 

Identification of thePathwav in which a Proliferation-Required 
2$ Gene Lies or the Pathway on which an Antibiotic Acts 

A. Preparation of Bacterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria containing the desired antisense 
construct are prepared using standard microbiological techniques. For example, a single clone of the organism can be 
isolated by streaking out a sample of the original stock onto an agar plate containing nutrients for cell growth and an 
30 antibiotic for which the antisense construct contains a gene which confers resistance. After overnight growth an isolated 

coiony is picked from the plate with a sterile needle and transferred to an appropriate liquid growth media containing the 
antibiotic required for maintenance of the plasmid. The cells are incubated at 30°C to 37°C with vigorous shaking for 4 to 
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6 hours to yield a culture in exponential growth. Sterile glycerol is added to 15% (volume to volume) and 100^L to 500 jaL 
aliquots are distributed into sterile cryotubes, snap frozen in liquid nitrogen, and stored at -80°C for future assays. 

B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed (37°C water bath) and a loop of 
5 culture is streaked out on an agar plate containing nutrients for cell growth and an antibiotic to which the antisense 

construct confers resistance. After overnight growth at 37°C, ten randomly chosen, isolated colonies are transferred from 
the plate (sterile inoculum loop) to a sterile tube containing 5 mL of LB medium containing the antibiotic to which the 
antisense vector confers resistance. After vigorous mixing to form a homogeneous cell suspension, the optical density of 
the suspension is measured at 600 nm (ODB00) and if necessary an aliquot of the suspension is diluted into a second tube 
10 of 5 mL, sterile, LB medium plus antibiotic to achieve an ODB00 < 0.02 absorbance units. The culture is then incubated at 

37° C for 1-2 hrs with shaking until the 0D600 reaches OD 0.2 - 0.3. At this point the cells are ready to be used in the 
assay. 

C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing the appropriate antibiotic for 

15 maintenance of the antisense construct. Several media are tested side by side and three to four wells are used to evaluate 

the effects of the inducer at each concentration in each media. For example, M 9 minimal media, LB broth, TBD broth and 
Muller-Hinton media may be tested with the inducer IPTG at the following concentrations, 50 j^M, 100 jaM, 200 ^M, 400 
liM, 600 fiM, 800 and 1000 \xfA. Equal volumes of test media-inducer and cells are added to the wells of a 384 well 
microliter plate and mixed. The cells are prepared as described above and diluted 1:100 in the appropriate media 

20 containing the test antibiotic immediately prior to addition to the microtiter plate wells. For a control, cells are also added 

to several wells of each media that do not contain inducer, for example 0 M IPTG. Cell growth is monitored continuously 
by incubation at 37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour period. The percent 
inhibition of growth produced by each concentration of inducer is calculated by comparing the rates of logarithmic growth 
against that exhibited by cells growing in media without inducer. The medium yielding greatest sensitivity to inducer is 

25 . selected for use in the assays described below. 

D - Measurement of Test An tibiotic Sensitivity in the Absence of Antisense Construct Induction 

Two-fold dilution series of antibiotics of known mechanism of action are generated in the culture media selected 
for further assay development that has been supplemented with the antibiotic used to maintain the construct. A panel of 
test antibiotics known to act on different pathways is tested side by side with three to four wells being used to evaluate 

30 the effect of a test antibiotic on cell growth at each concentration. Equal volumes of test antibiotic and cells are added to 

the wells of a 384 well microtiter plate and mixed. Cells are prepared as described above using the media selected for 
assay development supplemented with the antibiotic required to maintain the antisense construct and are diluted 1:100 in 
identical media immediately prior to addition to the microtiter plate wells. For a control, cells are also added to several 
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wells that contain the solvent used to dissolve the antibiotics but no antibiotic. Cell growth is monitored continuously by 
incubation at 37°C in a microliter plate reader monitoring the OD600 of the wells over an 18-hour period. The percent 
inhibition of growth produced by each concentration of antibiotic is calculated by comparing the rates of logarithmic 
growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition against 
5 loglantibiotic concentration) allows extrapolation of an IC 50 value for each antibiotic. 

E - Measurement of Test Antib iotic Sensitivity in the Presence of Antisense Construct Inducer 

The culture media selected for use in the assay is supplemented with inducer at concentrations shown to inhibit 
cell growth by 50 and 80% as described above and the antibiotic used to maintain the construct. Two fold dilution series 
of the panel of test antibiotics used above are generated in each of these media. Several antibiotics are tested side by side 
10 with three to four wells being used to evaluate the effects of an antibiotic on cell growth at each concentration, in each 

media. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microliter plate and mixed. Cells are 
prepared as described above using the media selected for use in the assay supplemented with the antibiotic required to 
maintain the antisense construct. The cells are diluted 1:100 into two 50 mL aliquots of identical media containing 
concentrations of inducer that have been shown to inhibit cell growth by 50% and 80 % respectively and incubated at 
15 37°C with shaking for 2.5 hours. Immediately prior to addition to the microliter plate wells, the cultures are adjusted to 

an appropriate 00 600 (typically 0.002) by dilution into warm (37°C) sterile media supplemented with identical 
concentrations of the inducer and antibiotic used to maintain the antisense construct. For a control, cells are also added 
to several wells that contain solvent used to dissolve test antibiotics but which contain no antibiotic. Cell growth is 
monitored continuously by incubation at 37°C in a microliter plate reader monitoring the 0D600 of the wells over an 18- 
20 hour period. The percent inhibition of growth produced by each concentration of antibiotic is calculated by comparing the 

rates of logarithmic growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition 
against loglantibiotic concentration] allows extrapolation of an IC 50 value for each antibiotic. 

F. Determining the Specificity of the Test Antibiotics 

A comparison of the IC 50 s generated by antibiotics of known mechanism of action under antisense induced and 
25 non-induced conditions allows the pathway in which a proliferation-required nucleic acid lies to be identified. If cells 

expressing an antisense nucleic acid against a proliferation-required gene are selectively sensitive to an antibiotic acting 
via a particular pathway, then the gene against which the antisense acts is involved in the pathway in which the antibiotic 
acts. 

G. Identification of Pathway in which a Test Antibiotic Acts 

30 As discussed above, the cell based assay may also be used to determine the pathway against which a test 

antibiotic acts. In such an analysis, the pathways against which each member of a panel of antisense nucleic acids acts 
are identified as described above. A panel of cells, each containing an inducible antisense vector against a gene in a 
known proliferation-required pathway, is contacted with a test antibiotic for which it is desired to determine the pathway 
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on which it acts under inducing an non-inducing conditions. If heightened sensitivity is observed in induced cells expressing 
antisense against a gene in a particular pathway but not in induced cells expressing antisense against genes in other 
pathways, then the test antibiotic acts against the pathway for which heightened sensitivity was observed. 

One skilled in the art will appreciate that further optimization of the assay conditions, such as the concentration 
5 of inducer used to induce antisense expression and/or the growth conditions used for the assay (for example incubation 

temperature and media components) may further increase the selectivity and/or magnitude of the antibiotic sensitization 
exhibited. 

The following example confirms the effectiveness of the methods described above. 

EXAMPLE 11 

10 Identificatio n of the Pathway in which a Proliferation-Required Gene Lips 

Antibiotics of various chemical classes and modes of action were purchased from Sigma Chemicals (St. Louis, 
MO). Stock solutions were prepared by dissolving each antibiotic in an appropriate aqueous solution based on information 
provided by the manufacturer. The final working solution of each antibiotic contained no more than 0.2% (w/v) of any 
organic solvent. To determine their potency against a bacterial strain engineered for expression of an antisense against a 

15 proliferation-required. 50S ribosomal protein, each antibiotic was serially diluted two or three fold in growth medium 

supplemented with the appropriate antibiotic for maintenance of the anti-sense construct. At least ten dilutions were 
prepared for each antibiotic. 25 jxL aliquots of each dilution were transferred, to discrete wells of a 384-well microplate 
(the assay plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of an antibiotic under each 
treatment condition (plus and minus inducer). Each assay plate contained twenty wells for cell growth controls (growth 

20 media replacing antibiotic), ten wells for each treatment (plus and minus inducer, in this example IPTG). Assay plates were 

usually divided into the two treatments: half the plate containing induced cells and an appropriate concentrations of 
inducer (in this example IPTG) to maintain the state of induction, the other half containing non-induced cells in the absence 
of IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a construct, from which expression of 
25 antisense nucleic acid against rpIL and rpU, which encode proliferation-required 50S ribosomal subunit proteins, is 

inducible in the presence of IPTG, were grown into exponential growth (OD 600 0.2 to 0.3) and then diluted 1:100 into fresh 
media containing either 400 \iM or 0 nM inducer (IPTG). These cultures were incubated at 37° C for 2.5 nr. After a 2.5 
hr incubation, induced and non-induced cells were respectively diluted into an assay medium at a final 0D 60o value of 
0.0004. The medium contained an appropriate concentration of the antibiotic for the maintenance of the anti-sense 
construct. In addition, the medium used to dilute induced cells was supplemented with 800 IPTG so that addition to 
the assay plate would result in a final IPTG concentration of 400 yiM. Induced and non-induced cell suspensions were 
dispensed (25 ^l/well) into the appropriate wells of the assay plate as discussed previously. The plate was then loaded 
into a plate reader, incubated at constant temperature, and cell growth was monitored in each well by the measurement of 
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light scattering at 595 nm. Growth was monitored every 5 minutes until the cell culture attained a stationary growth 
phase. For each concentration of antibiotic, a percentage inhibition of growth was calculated at the time point 
corresponding to mid-exponential growth for the associated control wells (no antibiotic, plus or minus IPTG). For each 
antibiotic and condition (plus or minus IPTG). a plot of percent inhibition versus log of antibiotic concentration was 
generated and the IC50 determined. A comparison of the IC 50 for each antibiotic in the presence and absence of IPTG 
revealed whether induction of the antisense construct sensitized the cell to the mechanism of action exhibited by the 
antibiotic. Cells which exhibited a significant (standard statistical analysis) numerical decrease in the IC M value in the 
presence of inducer were considered to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of the antibiotics used in the 
analysis, the . targets of the antibiotics, the IC50 in the absence of IPTG. the IC50 in the presence of IPTG, the 
concentration units for the IC50s, the fold increase in IC50 in the presence of IPTG, and whether increased sensitivity was 
observed in the presence of IPTG. 
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The above results demonstrate that induction of an antisense RNA to genes encoding 5QS ribosomal subunit 
proteins results in a selective and highly significant sensitization of cells to antibiotics that inhibit ribosomal function and 
protein synthesis. The above results further demonstrate that induction of an antisense construct to an essential gene 
sensitizes an organism to compounds that interfere with that gene products' biological role. This sensitization is restricted 
to compounds that interfere with pathways associated with the targeted gene and it's product. 

Assays utilizing antisense constructs to essential genes can be used to identify compounds that specifically 
interfere with the activity of multiple targets in a pathway. Such constructs can be used to simultaneously screen a 
sample against multiple targets in one pathway in one reaction (Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells may be used to characterize the 
point of intervention of any compound affecting an essential biological pathway including antibiotics with no known 
mechanism of action. 

Another embodiment of the present invention is a method for determining the pathway against which a test 
antibiotic compound is active in which the activity of target proteins or nucleic acids involved in proliferation-required 
pathways is reduced by contacting cells with a sublethal concentration of a known antibiotic which acts against the 
target protein or nucleic acid. In one embodiment, the target protein or nucleic acid is a target protein or nucleic acid 
corresponding to a proliferation-required nucleic acid identified using the methods described above. The method is similar 
to those described above for determining which pathway a test antibiotic acts against except that rather than reducing the 
activity or level of a proliferation-required gene product using a sublethal level of antisense to a proliferation-required 
nucleic acid, the activity or level of the proliferation-required gene product is reduced using sublethal level of a known 
antibiotic which acts against the proliferation required gene product. 

Interactions between drugs which affect the same biological pathway has been described in the literature. For 
example, Mecillinam (Amdinocillin) binds to and inactivates the penicillin binding protein 2 (PBP2, product of the mrdA in £ 
co/A. This antibiotic inteacts with other antibiotics that inhibit PBP2 as well as antibiotics that inhibit other penicillin 
binding proteins such as PBP3 [(Gutmann, L, Vincent, S., Billot-Klein, D., Acar, J.F., Mrena, E., and Williamson, R. (1986) 
Involvement of penicillin-binding protein 2 with other penicillin-binding proteins in lysis of Escherichia co/fby some beta- 
lactam antibiotics alone and in synergistic lytic effect of amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 
30:906-912), the disclosure of which is incorporated herein by reference in its entirety]. Interactions between drugs 
could, therefore, involve two drugs that inhibit the same target protein or nucleic acid or inhibit different proteins or 
nucleic acids in the same pathway [(Fukuoka, T., Domon, H., Kakuta, M., Ishii, C. Hirasawa, A., Utsui, Y., Ohya, S., and 
Yasuda, H. (1997) Combination effect between panipenem and vancomycin on highly methicillin-resistant Staphylococcus 
aureus. Japan. J. Antibio. 50:411-419; Smith, C.E., Foleno, B.E., Barrett, J.F., and Frosc, M.B. (1997) Assessment of the 
synergistic interactions of levofloxacin and ampicillin against Enterococcus faecium by the checkerboard agar dilution and 
time-kill methods. Diagnos. Microbiol. Infect. Disease 27:85-92; den Hollander, J.G., Horrevorts, A.M., van Goor, M.L, 
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Verbrugh. H.A., and Mouton, J.W. (1997) Synergism between tobramycin and ceftazidime against a resistant 
Pseudomonas aeruginosa strain, tested in an in vitro pharmacokinetic model. Antimicrobial Agents & Chemotherapy. 
41:95-1 10), the disclosure of all of which are incorporated herein by reference in their entireties). 

Two drugs may interact even though they inhibit different targets. For example, the proton pump inhibitor, 
5 Omeprazole, and the antibiotic. Amoxycillin, two synergistic compounds acting together, can cure Helicobacter pylori 

infection [( Gabryelewicz, A., Laszewicz, W., Dzieniszewski, J., Ciok, J., Marlicz, K:, Bielecki, D., Popiela, T.. Legutko, J., 
Knapik, Z., Poniewierka, E. (1997) Multicenter evaluation of dual-therapy (omeprazol and amoxycillin) for Helicobacter 
/^//-associated duodenal and gastric ulcer (two years of the observation). J. Physiol. Pharmacol. 48 Suppl 4:93-105), the 
disclosure of which is incorporated herein by reference in its entirety]. 
10 The fl rowth inhibition from the sublethal concentration of the known antibiotic may be at least about 5%, at 

- least about 8%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at 
least about 60%, or at least about 75%, or more. 

Alternatively, the sublethal concentration of the known antibiotic may be determined by measuring the activity of 
the target proliferation-required gene product rather than by measuring growth inhibition. 
15 Cells are con tacted with a combination of each member of a panel of known antibiotics at a sublethal level and 

varying concentrations of the test antibiotic. As a control, the cells are contacted with varying concentrations of the test 
antibiotic alone. The IC 50 of the test antibiotic in the presence and absence of the known antibiotic is determined. If the 
IC50s in the presence and absence of the known drug are substantially similar, then the test drug and the known drug act 
on different pathways. If the IC 50 s are substantially different, then the test drug and the known drug act on the same 
20 pathway. 

Another embodiment of the present invention is a method for identifying a candidate compound for use as an 
antibiotic in which the activity of target proteins or nucleic acids involved in proliferation-required pathways is reduced by 
contacting cells with a sublethal concentration of a known antibiotic which acts against the target protein or nucleic acid. 
In one embodiment, the target protein or nucleic acid is a target protein or nucleic acid corresponding to a proliferation- 

25 required nucleic acid identified using the methods described above. The method is similar to those described above for 

identifying candidate compounds for use as antibiotics except that rather than reducing the activity or level of a 
proliferation-required gene product using a sublethal level of antisense to a proliferation-required nucleic acid, the activity 
or level of the proliferation-required gene product is reduced using a sublethal level of a known antibiotic which acts 
against the proliferation required gene product. 

30 The 9 rowth inhibition from the sublethal concentration of the known antibiotic may be at least about 5%, at 

least about 8%. at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at 
least about 60%, or at least about 75%, or more. 
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Alternatively, the sublethal concentration of the known antibiotic may be determined by measuring the activity of 
the target proliferation-required gene product rather than by measuring growth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a panel of known antibiotics at a 
sublethal level and one or more concentrations of the test compound. As a control, the cells are contacted with the same 
concentrations of the test compound alone. The IC 50 of the test compound in the presence and absence of the known 
antibiotic is determined. If the IC 50 of the test compound is substantially different in the presence and absence of the 
known drug then the test compound is a good candidate for use as an antibiotic. As discussed above, once a candidate 
compound is identified using the above methods its structure may be optimized using standard techniques such as 
combinatoriai chemistry. 

Representative known antibiotics which may be used in each of the above methods are provided in the table 
below. However, it will be appreciated that other antibiotics may also be used. 



ANTIBIOTIC 



INHIBITS/TARGET 



RESISTANT MUTANTS 



Inhibitors of Transcription 

Rifamycin, 1959 Rifampicin 

Rifabutin Rifaximin 
Streptolydigin 

Streptovaricin 
Actinomycin D+EDTA 



Inhibits initiation of transcription/B-subunit RNA rpoB. crp, cyaA 
polymerase, rpoB 

Accelerates transcription chain termination/B- rpoB 
subunit RNA polymerase 

an acyclic ansamycin, inhibits RNA polymerase rpoB 
Intercalates between 2 successive 6-C pairs, pldA 
rpoB, inhibits RNA synthesis 



Inhibitors of Nucleic Acid Metabolism 

Quinolones, 1962 Nalidixic acid subunit gyrase and/or topoisomerase IV, gyrA 
Oxoiinic acid 

Fluoroquinolones Ciprofloxacin, subunit gyrase, gyrA and/or topoisomerase IV 
1983 Norfloxacin (probable target in Staph) 

Coumerins Novobiocin Inhibits ATPase activity of B-subunit gyrase, ■ 

gyrB 

Inhibits ATPase activity of B-subunit gyrase, 
gyrB 

DNA synthesis 

Causes single-strand breaks in DNA 



Coumermycin 

Albicidin 
Metronidazole 



Inhibitors of Metabolic Pathways 

Sulfonamides, 1932 Sulfanilamide blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, foiP 
Trimethoprim, 1962 Inhibits dihydrofolate reductase, folA 

Showdomycin Nucleoside analogue capable of alkylating 



gyrAorB r icd, s/oB 
gyrA 

norA (efflux in Staph) 
hipd 

gyrB f cysB, cysE, nov, 

ompA 
gyrB, hisW 

tar (nucleoside channel) 
nar 



MP, gpt, pabA, pabB, 
pabC 
folA, thyA 
nupC, pnp 
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INHIBITS/TARGET 



RESISTANT MUTANTS 



sulfhydryl groups, inhibitor of thymidylate 
synthetase 

type I) fatty acid synthase inhibitor 



Adenosine glycoside antibiotic, target is GMP 
synthetase 

Inhibits fatty acid synthesis 

heterocyclic, contains boron, inhibit fatty acid 

synthesis, enoyi-ACP reductase, fab/ 



emrB 

fadB, emrB due to gene 

dosage 
guaA,B 

fablfenvM/ 
fablfenvM) 



Inhibitors of Translation 
Phenylpropanoids 

Chloramphenicol, 1947 

Tetracyclines, 1948, type II 

polyketides 

Minocycline 

Doxycyciine 
Macrolides (type I polyketides) 

Erythromycin, 1950 

Carbomycin, Spiramycin 

etc 

Aminoglycosides Streptomycin, 
1944 

Neomycin 

Spectinomycin Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 
Lincosamides 

Lincomycin, 1955 Clindamycin 
Streptogramins Virginiamycin, 

1955 Pristinamycin 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin (Mocimycin) 



Binds to ribosomal peptidyl transfer center 
preventing peptide translocation/ binds to S6, 
L3, L6, 114, 116, 125, L26, L27, but 
preferentially to L16 

Binding to 30S ribosomal subunit, "A" site 
on 30S subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomal subunit, 23S rRNA, 
blocks peptide translocation, L15, L4, L12 



Irreversible binding to 30S ribosomal subunit, 
prevents translation or causes mistranslation of 
mRNA/16S rRNA 



Binding to 50 S ribosomal subunit, blocks 
peptide translocation 

2 components, Streptogramins A&B, bind to 
the 50S ribosomal subunit blocking peptide 
translocation and peptide bond formation 

Inhibition of elongation factor G (EF-G) prevents 

peptide translocation 
Inhibition of elongation factor TU (EF-Tu), 
prevents peptide bond formation 



rrn, cm/A, marA, ompF, 
ompR 

clmA fcmrj, mar, ompF 



rrn, rpIC, rp/D, rpIV , mac 



rpsL, strCM ubiF 
atpA-E, ecfB, 

hemAC,D,E,G, topA, 
rps£D,E, rrn, spcB 
atpA-atpE, cpxA, ecfB, 

hemA,B,L, topA 
ksgA,B,C,D, rplBX 

rp$l,N,M,R 
rp/F, ubiF 
cpxA 
rpsL 

HnB, rplN,0, rpsG 



fusA 



tufA,B 
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ANTIBIOTIC 



INHIBITS/TARGET 



RESISTANT MUTANTS 



Pulvomycin . 
Thiopeptin 

Tiamulin 
Negamycin 

Oxazoiidinones Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids Mupirocin 

(Bactroban) 
Indolmycin 
Viomycin 



Thiopeptides 

Thiostrepton 

Micrococcin 



Binds to and inhibits EF-TU 
Sulfur-containing antibiotic, inhibits protein 

synthesis,EF-G 
Inhibits protein synthesis 
Inhibits termination process of protein 

synthesis 
23S rRNA 



inhibits protein synthesis, nitroreductases 
convert nitrofurantoin to highly reactive 
electrophilic intermediates which attack 
bacterial ribosomal proteins non-specifically 

Inhibition of isoleucyl tRNA synthetase-used for 
Staph, topical cream, nasal spray 

Inhibits tryptophanyl-tRNA synthetase 



Binds toL11-23SRNA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



rplE 

rp/C, rpID 
prfB 



pdx 
nfnA,B 



HeS 
trpS 

rrmA (23S rRNA 
methyltransferase; 
mutant has slow 
growth rate, slow chain 
elongation rate, and 
viomycin resistance) 



Inhibitors of Cell Walls/Membranes 
fMactarns 

Penicillin, 1929 Ampicillin 

Methicillin, I960 



Cephalosporins, 1962 
Mecillinam (amdinocillin) 

Aztreonam (Furazlocillin) 
Bacilysin, Tetaine 
Glycopeptides Vancomycin, 1955 

Polypeptides Bacitracin 

Cyclic lipopeptide Daptomycin, 1980 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 
glycosidases (PBPs), of the 12 PBPs only 2 
are essential: mrdA (PBP2) and ftsl (pbpB, 
PBP3) 



Binds to and inactivates PBP2 [mrdA) 
Inactivates PBP3 [ftsf) 

Dipeptide, inhib glucosamine synthase 
Jnhib G+ cell wall syn, binds to terminal D- 
ala-D-ala of pentapeptide, 
Prevents dephosphorylation and regeneration 
of lipid carrier 

Disrupts multiple aspects of membrane 



ampC, ampD, ampE, 
envZ, galU. hipA, 
hipQ, ompC, ompF, 
ompR, ptsL rfa, to/D, 
tolE 

tonB 

alaS, argS, crp, cyaA, 
envB, mrdA,B, 
mreB,C,D 

dppA 



rfa 
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Cyclic polypeptides Polymixin, 1939 
Fosfomycin, 1969 

Cycloserine 
Alafosf alin 



function, including peptidoglycan synthesis, 
lipoteichoic acid synthesis, and the bacterial 
membrane potential 

Surfactant action disrupts cell membrane 
lipids, binds lipid A mioety of LPS 
Analogue of P-enolpyruvate, inhibits 1 st step 
in peptidoglycan synthesis • UDP-N- 
acetylglucosamine enoipyruvyl transferase, 
murA. Also acts as Immunosuppressant 
Prevents formation of D-ala dimer, inhibits D- 
ala ligase, ddlA.B 

phosphonodipeptide, cell wail synthesis 
inhibitor, potentiator of -lactams 



Inhibitors of Protein Processing/Transport 

Globomycin inhibits signal peptidase II (cleaves 

prolipoproteins subsequent to lipid 
modification, IspA 



pmrA 

murA, crp, cyaA glpT, 
hip A, ptsl, uhpT 



hipA, cycA 
pep A, tpp 



Ipp, dnaE 



20 



EXAMPLE 12 

Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species Usi n g the E cofflMSmn Vector, nr Fvp rp^inn 

Vectors Func tional in Bacterial Species other than£ coli. 
The above methods were validated using antisense nucleic acids which inhibit the growth of £ coll which were 
identified using methods similar to those described above. Expression vectors which inhibited growth of E. coli upon 
induction of antisense RNA expression with IPTG were transformed directly into Enterobacter cloacae, Klebsiella 
pneumonia or Salmonella typhimurium. The transformed cells were then assayed for growth inhibition according to the 
method of Example 1. After growth in liquid culture, cells were plated at various serial dilutions and a score determined by 
calculating the log difference in growth for INDUCED vs. UNINDUCED antisense RNA expression as determined by the 
maximum 10 fold dilution at which a colony was observed. The results of these experiments are listed below in Table VI. 
If there was no effect of antisense RNA expression in an organism, the clone is minus in Table VI. In contrast, a positive in 
Table VI means that at least 10 fold more cells were required to observe a colony on the induced plate than on the non- 
induced plate under the conditions used and in that organism. 

Sixteen of the construts were found to inhibit growth in ail the organisms tested upon induction of antisense 
RNA expression with IPTG. Those skilled in the art will appreciate that a negative result in a heterologous organism does 
not mean that that organism is missing that gene nor does it mean that the gene is unessential. However, a positive result 
means that the heterologous organism contains a homologous gene which is required for proliferation of that organism. 
The homologous gene may be obtained using the methods described herein. Those cells that are inhibited by antisense 
may be used in cell based assays as described herein for the identification and characterization of compounds in order to 
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develop antibiotics effective in these organisms. Those skilled in the art will appreciate that an.antisense molecule which 
works in the organism from which it was obtained will not always work in a heterologous organism. 



TABLE VI 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit Proliferation inf. coii 



Mol. No. 


S. typhimurium 


£ cloacae 


K. pneumoniae 


EcXAOOl 


+ 


+ 


- 


EcXA004 


■ 


- 




EcXA005 


+ 




+ 


EcXA006 


- 


•- 


• 


EcXA007 


- 


+ 




ECXA008 


+ 


• 


+ 


EcXAOlO 


+ 


+ 


+ 


EcXAOl 1 


• 


+ 


• 


EcXAOl 2 


• 


+ 


• 


EcXAOl 3 


+ 


+ 


+ 


EcXAOl 4 


+ 


+ 


• 


EcXAOl 5 


• 


+ 


+ 


EcXAOl 6 


+ 


+ 


+ 


EcXAOl 7 


+ 


+ 


+ 


EcXAOl 8 


+ 


+ 


+ 


EcXA019 


+ 


+ 


+ 


EcXA020 


+ i 


+ 


+ 


CLAHUi 1 




+ 


+ 


EcXA023 


+ 


+ 




EcXA024 


+ 




+ 


EcXA025 








EcXA026 


+ 


+ 




EcXA027 


+ 


+ 


+ 


EcXA028 








EcXA029 
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Mol. No. 


S. typhimurium 


£ cloacae 


K. pneumoniae 


EcXAujU 






+ 


EcXAOd 1 


+ 


- 


• 


EcXA032 




• 


• 


EcXA033 


+ 


+ 


+ 


EcXA034 




+ 


+ 


EcXA035 


■ 


- 


• 


EcXA036 


+ 


- 




EcXA037 


• 


+ 


• 


EcXA038 


+ 


+ 


- 


« f— 1/ > Ann 

EcXA039 


+ 


• 


• 


EcXA041 




+ 




EcXA042 


- 


+ 




EcXA044 




- 




EcXA045 


• 


+ 




EcXA046 


- 


• 




EcXA047 




+ 




EcXA048 


- 






EcXA049 


+ 


■ 




EcXA050 


• 


- 




EcXA051 


+ 






EcXA052 




• 




LLAnUJJ 


+ 


+ 


+ 


EcXA054' 








EcXA055 


+ 







EXAMPLE 13 

Use of Identified Exogenous Nucleic Acid Sequences as Probes 
The identified sequence of the present invention can be used as probes to obtain the sequence of additional 
genes of interest from a second organism. For example, probes to potential bacterial target proteins may be hybridized to 
nucleic acids from other organisms including other bacteria and higher organisms, to identify homologous sequences. Such 
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hybridization might indicate that the protein encoded by the gene to which the probe corresponds is found in humans and 
therefore not necessarily a good drug target. Alternatively, the gene can be conserved only in bacteria and therefore 
would be a good drug target for a broad spectrum antibiotic or antimicrobial. 

Probes derived from the identified nucleic acid sequences of interest or portions thereof can be labeled with 
5 detectable labels familiar to those skilled in the art, including radioisotopes and non-radioactive labels, to provide a detectable 

probe. The detectable probe can be single stranded or double stranded and can be made using techniques known in the art, 
including in vitro transcription, nick translation, or kinase reactions. A nucleic acid sample containing a sequence capable of 
hybridizing to the labeled probe is contacted with the labeled probe. If the nucleic acid in the sample is double stranded, it can 
be denatured prior to contacting the probe. In some applications, the nucleic acid sample can be immobilized on a surface such 

10 as a nitrocellulose or nylon membrane. The nucleic acid sample can comprise nucleic acids obtained from a variety of sources, 

including genomic DNA, cDNA libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to the detectable probe include well 
known techniques such as Southern blotting, Northern blotting, dot blotting, colony hybridization, and plaque hybridization. In 
some applications, the nucleic acid capable of hybridizing to the labeled probe can be cloned into vectors such as expression 

15 vectors, sequencing vectors, or in vitro transcription vectors to facilitate the characterization and expression of the hybridizing 

nucleic acids in the sample. For example, such techniques can be used to isolate, purify and clone sequences from a genomic 
library, made from a variety of bacterial species, which are capable of hybridizing to probes made from the sequences identified 
in Examples 5 and 6. 

EXAMPLE 14 

20 Preparation of PCR Primers and Amplification of DNA 

The identified E. coli genes corresponding directly to or located within the operon of nucleic acid sequences required 
for proliferation or portions thereof can be used to prepare PCR primers for a variety of applications, including the identification 
or isolation of homologous sequences from other species, for example S. typhimurium, £ cloacae, and Klebsiella pneumoniae, 
which contain part or all of the homologous genes. Because homologous genes are related but not identical in sequence, those 

25 ^ skilled in the art will often employ degenerate sequence PCR primers. Such degenerate sequence primers are designed based on 
conserved sequence regions, either known or suspected, such as conserved coding regions. The successful production of a PCR 
product using degenerate probes generated from the sequences identified herein would indicate the presence of a homologous 
gene sequence in the species being screened. The PCR primers are at least 10 bases, and preferably at least 20 bases in 
length. More preferably, the PCR primers are at least 20-30 bases in length. In some embodiments, the PCR primers can be 

30 more than 30 bases in length. It is preferred that the primer pairs have approximately the same G/C ratio, so that melting 

temperatures are approximately the same. A variety of PCR techniques are familiar to those skilled in the art. For a review of 
PCR technology, see Molecular Cloning to Genetic Engineering White, B.A. Ed. in Methods in Molecular Biology 67: Humana 
Press, Totowa 1997. When the entire coding sequence of the target gene is known, the 5' and 3' regions of the target gene 
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can be used as the sequence source for PCR probe generation. In each of these PCR procedures. PCR primers on either side of 
the nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid sample along with dNTPs and a 
thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the sample is 
denatured and the PCR primers are specifically hybridized to complementary nucleic acid sequences in the sample. The 
5 hybridized primers are extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. The cycles 

are repeated multiple times to produce an amplified fragment containing the nucleic acid sequence between the primer sites. 

EXAMPLE 15 
Inverse PCR 

The technique of inverse polymerase chain reaction can be used to extend the known nucleic acid sequence identified 
10 in Examples 5 and 6. The inverse PCR reaction is described generally by Ochman et al., in Ch. 10 of PCR Technology: 

Principles and Applications for DNA Amplification, (Henry A. Erlich, Ed.) W.H. Freeman and Co. (1992). Traditional PCR 
requires two primers that are used to prime the synthesis of complementary strands of DNA. In inverse PCR, only a core 
sequence need be known. 

Using the sequences identified as relevant from the techniques taught in Examples 5 and 6 and applied to other 
15 species of bacteria, a subset of exogenous nucleic sequences are identified that correspond to genes or operons that are 

required for bacterial proliferation. In species for which a genome sequence is not known, the technique of inverse PCR 
provides a method for obtaining the gene in order to determine the sequence or to place the probe sequences in full context to 
the target sequence to which the identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an appropriate restriction enzyme so 
20 as to create fragments of nucleic acid that contain the identified sequence as well as unknown sequences that flank the 

identified sequence. These fragments are then circularized and become the template for the PCR reaction. PCR primers are 
designed in accordance with the teachings of Example 15 and directed to the ends of the identified sequence are synthesized. 
The primers direct nucleic acid synthesis away from the known sequence and toward the unknown sequence contained within 
the circularized template. After the PCR reaction is complete, the resulting PCR products can be sequenced so as to extend the 
25 sequence of the identified gene past the core sequence of the identified exogenous nucleic acid sequence identified, in this 

manner, the. full sequence of each novel gene can be identified. Additionally the sequences of adjacent coding and noncoding 
regions can be identified. 

EXAMPLE 16 

Identification of Genes Required for Staphylococcus aureus Proliferation 
Genes required for proliferation in Staphylococcus aureus are identified according to the methods described above. 

EXAMPLE 17 

Identification of Genes Required for Neisseria gonorrhoeae Proliferation 
Genes required for proliferation in Neisseria gonorrhoeae are identified according to the methods described above. 
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EXAMPLE 18 

Identification of Gen es Required for Pseudomonas aeruginosa Proliferation 
Genes required for proliferation in Pseudomonas aeruginosa are identified according to the methods described 

above. 

5 EXAMPLE 19 

Identification of Genes Required for Enterococcus faecah's Proliferation 
Genes required for proliferation in Enterococcus faecaiis are identified according to the methods described above. 

EXAMPLE 20 

Identificatio n of Genes Required for Haemophilus influenzae Proliferation 
1 0 Genes squired for proliferation in Haemophilus influenzae are identified according to the methods described above. 

EXAMPLE 21 

Identification of Genes Required for Salmonella typhimurium Proliferation 
Genes required for proliferation in Salmonella typhimurium are identified according to the methods described above. 

EXAMPLE 22 

15 Identification of Genes Required for Helicobacter pylori Proliferation 

Genes required for proliferation in Helicobacter pylori zxz identified according to the methods described above. 

EXAMPLE 23 

Identifica tion of Genes Required for Mycoplasma pneumoniae Proliferation 
Genes required for proliferation in Mycoplasma pneumoniae are identified according to the methods described 

20 above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
Genes required for proliferation in Plasmodium ovale are identified according to the methods described above. 

EXAMPLE 25 

25 Identificatio n of Genes Required for Saccharomvces cerevisiae Proliferation 

Genes required for proliferation in Saccharomyces cerevisiae are identified according to the methods described- 
above. 

EXAMPLE 26 

Identification of Genes Required for Entamoeba histolytica Proliferation 
30 Genes squired for proliferation in Entamoeba histolytica are identified according to the methods described above. 

EXAMPLE 27 

Identification of Genes Required for Candida albicans Proliferation 
Genes required for proliferation in Candida albicans are identified according to the methods described above. 
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EXAMPLE 28 

Identification of Genes Required for Klebsiella pneumoniae Proliferation 
Genes required for proliferation in Klebsiella pneumoniae are identified according to the methods described above. 

EXAMPLE 29 

5 Identification of Genes Required for Salmonella typhi Proliferation 

Genes required for proliferation in Salmonella typhi are identified according to the methods described above. 

EXAMPLE 30 

Identification of Genes Required for Salmonella paratyphi Proliferation 
Genes required for proliferation in Salmonella paratyphi are identified according to the methods described above. 
10 EXAMPLE 31 

Identification of Genes Required for Salmonella cholerasuis Proliferation 
Genes required for proliferation in Salmonella cholerasuis are identified according to the methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis Proliferation 

15 Genes ret P red for proliferation in Staphylococcus epidermis are identified according to the methods described 

above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tuberculosis Proliferation 
Genes required for proliferation ^Mycobacterium tuberculosis are identified according to the methods described 

20 above. 

EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according to the methods described above. 

EXAMPLE 35 

25 Identification of Genes Required for Treponema pallidum Proiiferation 

Genes required for proliferation in Treponema pallidum are identified according to the methods described above. 

EXAMPLE 36 

Identification of Genes Required for Bacillus anthracis Proliferation 
Genes required for proliferation in Bacillus anthracis are identified according to the methods described above. 
30 EXAMPLE 37 

Identification of Genes Required for Yersinia pestis Proliferation 
Genes required for proliferation in Yersinia pestis are identified according to the methods described above. 
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EXAMPLE 3B 

Identification of Genes Required for Clostridium botulinum Proliferation 
Genes required for proliferation in Clostridium botuiinum are identified according to the methods described above. 

EXAMPLE 39 

5 Identification of Genes Required for Campylobacter jejuni Proliferation 

Genes required for proliferation in Campylobacter jejuni are identified according to the methods described above. 

EXAMPLE 40 

Identification of Genes Required for Chlamydia trachomatis Proliferation 
Genes required for proliferation in Chlamydia trachomatis are identified according to the methods described above. 
1 0 Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule libraries to identify compounds useful to 
identify antibiotics, the sequences themselves can be used as therapeutic agents. Specifically, the identified exogenous 
sequences in an antisense orientation can be provided to an individual to inhibit the translation of a bacterial target gene. 
Generation of Antisense Therapeutics from Identified Exogenous Sequences 
15 The sequences of the present invention can be used as antisense therapeutics for the treatment of bacterial 

infections or simply for inhibition of bacterial growth in vitro or in vivo. The therapy exploits the biological process in cells 
where genes are transcribed into messenger RNA (mRNA) that is then translated into proteins. Antisense RNA technology 
contemplates the use of antisense oligonucleotides directed against a target gene that will bind to its target and decrease 
or inhibit the translation of the target mRNA. In one embodiment, antisense oligonucleotides can be used to treat and 
20 control a bacterial infection of a cell culture containing a population of desired cells contaminated with bacteria. In 

another embodiment, the antisense oligonucleotides can be used to treat an organism with a bacterial infection. 

Antisense oligonucleotides can be synthesized from any of the sequences of the present invention using methods 
well. known in the art. In a preferred embodiment, antisense oligonucleotides are synthesized using artificial means. 
Uhlmann & Peymann, Chemical Rev. 90:543-584 (1990) review antisense oligonucleotide technology in detail. Modified or 
25 ^ unmodified antisense oligonucleotides can be used as therapeutic agents. Modified antisense oligonucleotides are 
preferred since it is well known that antisense oligonucleotides are extremely unstable. Modification of the phosphate 
backbones of the antisense oligonucleotides can be achieved by substituting the internucleotide phosphate residues with 
methylphosphonates, phosphorothioates, phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs 
such as siloxane bridges, carbonate brides, thioester bridges, as well as many others known in the art. The preparation of 
30 certain antisense oligonucleotides with modified internucleotide linkages is described in U.S. Patent No. 5,142,047, hereby 

incorporated by reference. 

Modifications to the nucleoside units of the antisense oligonucleotides are also contemplated. These 
modifications can increase the half-life and increase cellular rates of uptake for the oligonucleotides in vivo. For example. 
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a-anomeric nucleotide units and modified bases such as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-1.phenylribofuranose, and 
ft, A^-ethano-5-methyl-cytosine are contemplated for use in the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic acids. Peptide nucleic acids (PNA) 
have been developed to hybridize to single and double stranded nucleic acids. PNA are nucleic acid analogs in which the entire 
deoxyribose-phosphate backbone has been exchanged with a chemically completely different, but structurally homologous, 
polyamide (peptide) backbone containing 2-aminoetbyl glycine units. Unlike DNA, which is highly negatively charged, the PNA 
backbone is neutral. Therefore, there is much less repulsive energy between complementary strands in a PNA-DNA hybrid than 
in the comparable DNA-DNA hybrid, and consequently they are much more stable. PNA can hybridize to DNA in either a 
Watson/Crick or Hoogsteen fashion (Demidov et a!., Proc. Natl. Acad. Sci. U.S.A. 92:2637-2641, 1995; Egholm, Nature 
365:566-568, 1993; Nielsen et al., Science 254:1497-1500, 1991; Dueholm et al. # New J. Chem. 21:19 31, 1997). 

Molecules called PNA "clamps" have been synthesized which have two identical PNA sequences joined by a flexible 
hairpin linker containing three 8-amino-3,6-dioxaoctanoic acid units. When a PNA clamp is mixed with a complementary 
homopurine or homopyrimidine DNA target sequence, a PNA-DNA-PNA triplex hybrid can form which has been shown to be 
extremely stable (Bentin et al., Biochemistry. 35:8863-8869, 1996; Egholm et a!.. Nucleic Acids Res. 23:217-222, 1995; 
Griffith et al., J. Am. Chem. Soc. 1 17:831-832, 1 995). 

The sequence-specific and high affinity duplex and triplex binding of PNA have been extensively described (Nielsen et 
al., Science 254:1497-1500, 1991; Egholm et al., J. Am. Chem. Soc. 1 14:9677-9578, 1992; Egholm et al., Nature 365:586- 
568, 1993; Almarsson et al., Proc. Natl. Acad. Sci. U.SJl. 90:9542-9546, 1993; Demidov et aL, Proc. Natl Acad Sci. U.SJl. 
92:2637-2641, 1995). They have also been shown to be resistant to nuclease and protease digestion (Demidov et aL, 
Biochem. Pharm. 48:1010-1313, 1994). PNA has been used to inhibit gene expression (Hanvey et al., Science 258:1481- 
1485,1992; Nielsen et al., NucL Acids. Res.. 21:197-200, 1993; Nielsen et al., Gene 149:139-145, 1994; Good & Nielsen, 
Science, 95: 2073-2076, 1998; all of which are hereby incorporated by reference), to block restriction enzyme activity 
(Nielsen et al., supra., 1993), to act as an artificial transcription promoter (Mollegaard, Proc. Natl. Acad. Sci. U.S.A. 91:3892- 
3895, 1994) and as a pseudo restriction endonuclease (Demidov et al., NucL Acids. Res. 21:2103-2107, 1993). Recently, PNA 
has also been shown to have antiviral and antitumoral activity mediated through an antisense mechanism (Norton, Nature 
BiotechnoL, 14:615-619, 1996; Hirschman et al., J. Investig. Med 44:347-351, 1996). PNAs have been linked to various 
peptides in order to promote PNA entry into ceils (Basu et al., Bioconj. Chem. 8:481-488, 1997; Pardridge et al., Proc. Natl. 
Acad. Sci. U.S.A. 92:5592-5596, 1 995). 

The antisense oligonucleotides contemplated by the present invention can be administered by direct application 
of oligonucleotides to a target using standard techniques well known in the art. The antisense oligonucleotides can be 
generated within the target using a plasmid, or a phage. Alternatively, the antisense nucleic acid may be expressed from a 
sequence in the chromosome of the target cell. It is further contemplated that contemplated that the antisense 
oligonucleotide contemplated are incorporated in a ribozyme sequence to enable the antisense to specifically bind and cleave its 
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target mRNA. For technical applications of ribozyme and antisense oligonucleotides see Rossi et al., Pharmacol. Ther. 
50(21:245-254. (1991), which is hereby incorporated by reference. The present invention also contemplates using a retron to 
introduce an antisense oligonucleotide to a cell. Retron technology is exemplified by U.S. Patent No. 5,405,775, which is 
hereby incorporated by reference. Antisense oligonucleotides can also be delivered using liposomes or by electroporation 
5 techniques which are well known in the art. 

The antisense nucleic acids of the present invention can also be used to design antibiotic compounds comprising 
nucleic acids which function by intracellular triple helix formation. Triple helix oligonucleotides are used to inhibit transcription 
from a genome. The sequences identified as required for proliferation in the present invention, or portions thereof, can be used 
as templates to inhibit microorganism gene expression in individuals infected with such organisms. Traditionally, homopurine 
10 sequences were considered the most useful for triple helix strategies. However, homopyrimidine sequences can also inhibit 

gene expression. Such homopyrimidine oligonucleotides bind to the major groove at homopurinerhomopyrimidine sequences. 
Thus, both types of sequences based on the sequences of the present invention that are required for proliferation are 
contemplated for use as antibiotic compound templates. 

The antisense oligonucleotides of this example employ the identified sequences of the present invention to induce 
1 5 bacterial cell death or at least bacterial stasis by inhibiting target gene translation. Antisense oligonucleotides containing 

from about 8 to 40 bases of the sequences of the present invention have sufficient complementary to form a duplex with 
the target sequence under physiological conditions. 

To kill bacterial cells or inhibit their growth, the antisense oligonucleotides are applied to the bacteria or to the 
target cells under conditions that facilitate their uptake. These conditions include sufficient incubation times of cells and 
20 oligonucleotides so that the antisense oligonucleotides are taken up by the cells. In one embodiment, an incubation period 

of 7-10 days is sufficient to kill bacteria in a sample. An optimum concentration of antisense oligonucleotides is selected 
for use. 

The concentration of antisense oligonucleotides to be used can vary depending on the type of bacteria sought to 
be controlled, the nature of the antisense oligonucleotide to be used, and the relative toxicity of the antisense 
25 oligonucleotide to the desired cells in the treated culture. Antisense oligonucleotides can be introduced to cell samples at a 

number of different concentrations preferably between 1x10 ,0 M to lx10"M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage suitable for use in vivo. For 
example, an inhibiting concentration in culture of 1x10' translates into a dose of approximately 0.6 mg/kg body weight. Levels 
of oligonucleotide approaching 100 mg/kg body weight or higher may be possible after testing the toxicity of the oligonucleotide 
in laboratory animals. It is additionally contemplated that cells from the subject are removed, treated with the antisense 
oligonucleotide, and reintroduced into the subject. This range is merely illustrative and one of skill in the art are able to 
determine the optimal concentration to be used in a given case. 
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After the bacterial cells have been killed or controlled in a desired culture, the desired cell population may be used 
for other purposes. 

EXAMPLE 41 

The following example demonstrates the ability of an £ coli antisense oligonucleotide to act as a bactericidal or 
5 bacteriostatic agent to treat a contaminated cell culture system. The application of the antisense oligonucleotides of the 

present invention are thought to inhibit the translation of bacterial gene products required for proliferation. 

The antisense oligonucleotide of this example corresponds to a ,30 base phophorothioate modified 
oligodeoxynucelotide complementary to a nucleic acid involved in proliferation, such as Molecule Number EcXAOOL A 
sense oligodeoxynucelotide complementary to the antisense sequence is synthesized and used as a control. The 
10 oligonucleotides are synthesized and purified according to the procedures of Matsukura, et a!., Gene 72:343 (1988). The 

test oligonucleotides are dissolved in a small volume of autoclaved water and added to culture medium to make a 100 
micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two patients and cultured according standard 
procedures well known in the art. The culture is contaminated with the K-12 strain of £ coli and incubated at 37°C 
1 5 overnight to establish bacterial infection. 

The control and antisense oligonucleotide containing solutions are added to the contaminated cultures and 
monitored for bacterial growth. After a 10 hour incubation of culture and oligonucleotides, samples from the control and 
experimental cultures are drawn and analyzed for the translation of the target bacterial gene using standard 
microbiological techniques well known in the art. The target £ coli gene is found to be translated in the control culture 
20 treated with the control oligonucleotide, however, translation of the target gene in the experimental culture treated with 

the antisense oligonucleotide of the present invention is not detected or reduced. 

EXAMPLE 42 

A subject suffering from an £ co// infection is treated with the antisense oligonucleotide preparation of Example 
39. The antisense oligonucleotide is provided in a pharmaceutically acceptable carrier at a concentration effective to 
25 inhibit the translation of the target gene. The present subject is treated with a concentration of antisense oligonucleotide 

sufficient to achieve a blood concentration of about 100 micromolar. The patient receives daily injections of antisense 
oligonucleotide to maintain this concentration for a period of 1 week. At the end of the week a blood sample is drawn and 
analyzed for the presence or absence using standard techniques well known in the art. There is no detectable evidence of 
E. coli and the treatment is terminated. 

30 EXAMPLE 43 

Preparation and use of Triple Helix Probes 
The sequences of microorganism genes required for proliferation of the present invention are scanned to identify 10- 
mer to 20-mer homopyrimidine or homopurine stretches that could be used in triple-helix based strategies for inhibiting gene 
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expression. Following identification of candidate homopyrimidine or homopurine stretches, their efficiency in inhibiting gene 
expression is assessed by introducing varying amounts of oligonucleotides containing the candidate sequences into a population 
of bacterial cells that normally express the target gene. The oligonucleotides may be prepared on an. oligonucleotide synthesizer 
or they may be purchased commercially from a company specializing in custom oligonucleotide synthesis, such as GENSET, 
5 Paris, France. 

The oligonucleotides can be introduced into the cells using a variety of methods known to those skilled in the art, 
including but not limited to calcium phosphate precipitation, DEAE-Dextran, eiectroporation, liposome-mediated transfection or 
native uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such as monitoring growth levels as 
10 compared to untreated cells using optical density measurements. The oligonucleotides that are effective in inhibiting gene 

expression in cultured cells can then be introduced in vivo using the techniques well known in that art at a dosage level shown 
to be effective. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be replaced with alpha anomers to 
render the oligonucleotide more resistant to nucleases. Further, an intercalating agent such as ethidium bromide, or the like, can 
15 be attached to the 3 end of the alpha oligonucleotide to stabilize the triple helix. For information on the generation of 

oligonucleotides suitable for triple helix formation see Griffin et at. (Science 245:967-971 (1 989), which is hereby incorporated 
by this reference). 

EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens bv PCR 
20 Classical bacteriological methods for the detection of various bacterial species are time consuming and costly. 

These methods include growing the bacteria isolated from a subject in specialized media, cultivation on selective agar 
media, followed by a set of confirmation assays that can take from 8 to 10 days or longer to complete. Use of the 
identified sequences of the present invention provides a method to dramatically reduce the time necessary to detect and 
identify specific bacterial species present in a sample. 
25 ln one exemplary method, bacteria are grown in enriched media and DNA samples are isolated from specimens of, for 

example, blood, urine, stool, saliva or central nervous system fluid by conventional methods. A panel of PCR primers based on 
identified sequences unique to various species of microorganisms are then utilized in accordance with Example 12 to amplify 
DNA of approximately 100-200 bases in length from the specimen. A separate PCR reaction is set up for each pair of PCR 
primers and after the PCR reaction is complete, the reaction mixtures are assayed for the presence of PCR product. The 
presence or absence of bacteria from the species to which the PCR primer pairs belong is determined by the presence or 
absence of a PCR product in the various test PCR reaction tubes. 

Although the PCR reaction is used to assay the isolated sample for the presence of various bacterial species, other 
assays such as the Southern blot hybridization are also contemplated. 
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WHAT IS CLAIMED IS : 

1. A purified or isolated nucleic acid sequence consisting essentially of one of SEQ ID NOs: 405*485, 
wherein said nucleic acid inhibits microorganism proliferation. 

2. The nucleic acid sequence of Claim 1, wherein said nucleic acid sequence is complementary to at least 
5 a portion of a coding sequence of a gene whose expression is required for microorganism proliferation. 

3. The nucleic acid sequence of Claims 1 or 2, wherein said nucleic acid comprises a fragment of one of 
SEQ ID NOs. 405-485, said fragment selected from the group consisting of fragments comprising at least 10, at least 20, 
at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 405-485. 

4. The nucleic acid sequence of Claim 3, wherein said nucleic acid sequence is complementary to a coding 
1 0 sequence of a gene whose expression is required for microorganism proliferation. 

5. A vector comprising a promoter operably linked to a nucleic acid comprising a sequence selected from 
the group consisting of SEQ ID NOs. 405-485. 

6. The vector of Claim 5, wherein said promoter is active in an organism selected from the group 
consisting of Escherichia coll Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, 

15 Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae. Salmonella 

typhimurium, Saccharomyces cerevisiae, Candida albicans, Crypt ococcus neoformans, Aspergillus fumigates, Klebsiella 
pneumoniae, Salmonella typhi. Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium 
tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, yersinia pestis, Clostridium botulinum, 
Campylobacter jejuni, Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the 

20 above species. 

7. A host cell containing the vector of Claim 5 or Claim 6. 

8. A purified or isolated nucleic acid consisting essentially of the coding sequence of one of SEQ ID NOs: 
82-88, 90-242. 

9. A fragment of the nucleic acid of Claim 8, said fragment comprising at least 10, at least 20, at least 
25 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90-242. 

10. A vector comprising a promoter operably linked to the nucleic acid of Claim 8 or Claim 9. 

11. A purified or isolated nucleic acid comprising a nucleic acid sequence complementary to at least a 
portion of an intragenic sequence, intergenic sequence, sequences spanning at least a portion of two or more genes, 5' 
noncoding region, or 3' noncoding region within an operon encoding a polypeptide comprising a sequence selected from the 

30 group consisting of SEQ ID NOs: 243-357, 359-398. 

12. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% homology to a 
sequence selected from the group consisting of SEQ ID NOs 405-485, 82-88, 90-242 or the sequences complementary 
thereto as determined using BLASTN version 2.0 with the default parameters. 
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13. The nucieic acid of Claim 12, wherein said nucleic acid is from an organism selected from the group 
consisting of Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enterococcus faecaiis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, 
Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, 

5 Salmonella typhi, Salmonella paratyphi] Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 

Mycobacterium leprae, Treponema pallidum. Bacillus anthracis, Yersinia pest is, Clostridium botulinum, Campylobacter 
jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above 
species. 

14. A purified or isolated nucleic acid consisting essentially of a nucleic acid encoding a polypeptide having 
10 a sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

15. A vector comprising a promoter operably linked to a nucleic acid encoding a polypeptide having a 
sequence selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

1 6. A host cell containing the vector of Claim 15. 

17. A purified or isolated polypeptide comprising the sequence of one of SEQ ID NOs: 243-357, 359-398. 
15 18. A purified or isolated polypeptide comprising a fragment of one of the polypeptides of SEQ ID NOs. 

243-357, 359-398, said fragment selected from the group consisting of fragments comprising at least 5, at least 10, at 
least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the 
polypeptides of SEQ ID NOs.: 243-357, 359-398. 

1 9. An antibody capable of specifically binding the polypeptide of Claim 1 7 or Claim 1 8. 
20 2Q - A method of producing a polypeptide, comprising introducing a vector comprising a promoter operably 

linked to a nucleic acid encoding a polypeptide having a sequence selected from the group consisting of SEO ID NOs. 243- 
357, 359-398into a. cell. 

21. The method of Claim 20, further comprising the step of isolating said protein. 

22. A method of inhibiting proliferation comprising inhibiting the activity or reducing the amount of a 
25 polypeptide having a sequence selected from the group consisting of SEQ ID NOs. 243-357, 359-398or inhibiting the 

activity or reducing the amount of a nucleic acid encoding said polypeptide. 

23. A method for identifying compounds which influence the activity of a polypeptide required for 
proliferation comprising: 

contacting a polypeptide having a sequence selected from the group consisting of 243-357, 359-398 
30 with a candidate compound; and 

determining whether said compound influences the activity of said polypeptide. 

24. The method of Claim 23, wherein said activity is an enzymatic activity. 

25. The method of Claim 23, wherein said activity is a carbon compound catabolism activity. 
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26. The method of Claim 23, wherein said activity is a biosynthetic activity. 

27. The method of Claim 23, wherein said activity is a transporter activity. 

28. The method of Claim 23, wherein said activity is a transcriptional activity. 

29. The method of Claim 23, wherein said activity is a DNA replication activity. 
5 N 30. The method of Claim 23, wherein said activity is a cell division activity. 

31. A method for assaying compounds for the ability to reduce the activity or level of a polypeptide 
required for proliferation, comprising: 

providing a target, wherein said target comprises the coding sequence of a sequence selected from the 
group consisting of SED ID NOs. 82-88, 90-242; 
10 contacting said target with a candidate compound; and 

measuring an activity of said target. 

32. The method of Claim 31, wherein said target is a messenger RNA molecule transcribed from a coding 
region of one of SEQ ID. NOs.: 82-88, 90-242 and said activity is translation of said messenger RNA. 

33. The method of Claim 32, wherein said target is a coding region of one of SEQ ID. NOs. 82-88, 90-242 
15 and said activity is transcription of said messenger RNA. 

34. A compound identified using the method of Claim 31. 

35. A method for identifying compounds which reduce the activity or level of a gene product required for 
cell proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene product in a cell to 
20 reduce the activity or amount of said gene product in said cell, thereby producing a sensitized celt- 

contacting said sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than 
said compound inhibits the growth of a nonsensitized cell. 

36. The method of Claim 35, wherein said cell is selected from the group consisting of bacterial cells, 
25 fungal cells, plant cells, and animal cells. 

37. The method of Claim 36, wherein said cell is an £ coli cell. 

38. The method of Claim 36, wherein said cell is from an organism selected from the group consisting of 
Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 

30 cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, 

Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botuiinum, Campylobacter Jejuni, and Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 
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39. The method of Claim 35, wherein said antisense nucleic acid is transcribed from an inducible promoter. 

40. The method of Claim 39, further comprising the step of contacting said cell with a concentration of 
inducer which induces said antisense nucleic acid to a sublethal level. 

41. The method of Claim 40, wherein said sub-lethal concentration of said inducer is such that growth 
5 inhibition is 8% or more. 

42. The method of Claim 40, wherein said inducer is isopropyl-l-thio-0-D-galactoside. 

43. The method of Claim 35, wherein growth inhibition is measured by monitoring optical density of a 
culture growth solution. 

44. The method of Claim 35, wherein said gene product is a polypeptide. 
10 45. The method of Claim 35, wherein said gene product is an RNA. 

46. The method of Claim 44, wherein said gene product comprises a polypeptide having a sequence 
selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

47. A compound identified using the method of Claim 35. 

48. A method for inhibiting cellular proliferation comprising introducing a compound with activity against a 
15 gene corresponding to one of SEQ ID NOs.: 82-88, 90-242 or with activity against the product of said gene into a 

population of cells expressing a gene. 

49. The method of Claim 48, wherein said compound is an antisense oligonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 405485, or a proliferation-inhibiting portion thereof. 

50. The method of Claim 49, wherein said proliferation inhibiting portion of one of SEQ ID NOs. 405-485 is 
20 a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of 

one of SEQ ID NOs: 405-485. 

51. The method of Claim 48, wherein said compound is a triple helix oligonucleotide. 

52. A preparation comprising an effective concentration of an antisense oligonucleotide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 405-485, or a proliferation-inhibiting portion thereof in a 

25 pharmaceutical^ acceptable carrier. 

53. The preparation of Claim 52, wherein said proliferation-inhibiting portion of one of SEQ ID NOs. 405- 
485 comprises at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of 
SEQ ID NOs: 405-485. 

54. A method for inhibiting the expression of a gene in an operon required for proliferation comprising 
30 contacting a cell in a cell population with an antisense nucleic acid, said cell expressing a gene corresponding to one of 

SEQ ID NOs.: 82-88, 90-242, wherein said antisense nucleic acid comprises at least a proliferation-inhibiting portion of 
said operon in an antisense orientation that is effective in inhibiting expression of said gene. 
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55. The method of Claim 54, wherein said antisense nucleic acid is complementary to a sequence of a gene 
comprising one or more of SEO ID NOs.: 82-88, 90-242. 

56. The method of Claim 54, wherein said antisense nucleic acid is a sequence of one of SEQ ID NOs.: 405- 
485, or a portion thereof. 

5 57. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 

a piasmid which expresses said antisense nucleic acid into said cell population. 

58. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a phage which expresses said antisense nucleic acid into said cell population. 

59. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
10 a sequence encoding said antisense nucleic acid into the chromosome of said cell into said cell population. 

60. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a retron which expresses said antisense nucleic acid into said cell population. 

61. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a ribozyme into said cell-population, wherein a binding portion of said ribozyme is complementary to said antisense 

15 oligonucleotide. 

62. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by introducing 
a liposome comprising said antisense oligonucleotide into said cell. 

63. The method of Claim 54, wherein said cell is contacted with said antisense nucleic acid by 
electroporation. 

20 64 - The method of Claim 54, wherein said antisense nucleic acid is a fragment comprising at least 10, at 

least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 82-88, 90-242. 

65. The method of Claim 54 wherein said antisense nucleic acid is an oligonucleotide. 

66. A method for identifying bacterial strains comprising the steps of: 
providing a sample containing a bacterial species; and 

25 m identifying a bacterial species using a species specific probe having a sequence selected from the group 

consisting of SEQ ID NOs. 405-485, 82-88, 90-242. 

67. A method for identifying a gene in a microorganism required for proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity of a gene or gene product 
required for proliferation in a first microorganism; 
30 . (b) contacting a second microorganism with said inhibitory nucleic acid; 

(c) determining whether said inhibitory nucleic acid from said first microorganism inhibits 
proliferation of said second microorganism; and 
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(d) identifying the gene in said second microorganism which is inhibited by said inhibitory nucleic 

acid. 

68. A method for assaying a compound for the ability to inhibit proliferation of a microorganism comprising: 
(a) identifying a gene or gene product required for proliferation in a first microorganism; 
5 (b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said homolog in 
said second microorgansim; 

(d) contacting said second microorganism with a proliferation-inhibiting amount of said inhibitory 
nucleic acid, thus sensitizing said second microorganism; 

10 (e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized microorganism to 
a greater extent than said compound inhibits proliferation of a nonsensitized microorganism. 

69. The method of Claim 68, wherein said step of identifying a gene involved in proliferation in a first 
microorganism comprises: 

15 introducing a nucleic acid comprising a random genomic fragment from said first microorganism 

operably linked to a promoter wherein said random genomic fragment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a first level of said random genomic 
fragment to the proliferation of said first microorganism transcribing a lower level of said random genomic 
fragment, wherein a difference in proliferation indicates that said random genomic fragment comprises a gene 

20 involved in proliferation. 

70. The method of Claim 69, wherein said step of identifying a homolog of said gene in a second 
microorganism comprises identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide in a 
database using an algorithm selected from the group consisting of BLASTN version 2.0 with the default parameters and 
FASTA version 3.0t78 algorithm with the default parameters. 

25 71. The method of Claim 69, wherein said step of identifying a homolog of said gene in a second 

microorganism comprises identifying a homologous nucleic acid or a nucleic acid encoding a homologous polypeptide by 
identifying nucleic acids which hybridize to said first gene. 

72. The method of Claim 69, wherein the step of identifying a homolog of said gene in a second 
microorganism comprises expressing a nucleic acid which inhibits the proliferation of said first microorganism in said 

30 second microorganism. 

73. The method of Claim 69, wherein said inhibitory nucleic acid is an antisense nucleic acid. 

74. The method of Claim 69, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
portion of said homolog. 
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75. The method of Claim 69, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
portion of the operon encoding said homolog. 

76. The method of Claim 69, wherein the step of contacting the second microorganism with a proliferation- 
inhibiting amount of said nucleic acid sequence comprises directly contacting said second microorganism with said nucleic 

5 acid. 

77. The method of Claim 69, wherein the step of contacting the second microorganism with a proliferation- 
inhibiting amount of said nucleic acid sequence comprises expressing an antisense nucleic acid to said homolog in said 
second microorganism. 

78. A compound identified using the method of Claim 68. 

10 79. A method of assaying a compound for the ability to inhibit proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inhibits the activity of a gene or gene 
product required for proliferation in a first microorgansim; 

(b) contacting a second microorganism with a proliferation-inhibiting amount of said inhibitory 
nucleic acid, thus sensitizing said second microorganism; 

1 5 -(c) contacting the proliferation-inhibited microorganism of step (b) with a compound; and 

(d) determining whether said compound inhibits proliferation of said sensitized second 
microorganism to a greater extent than said compound inhibits proliferation of a nonsensitized second microorganism. 

80. The method of Claim 79, wherein said inhibitory nucleic acid is an antisense nucleic acid which inhibits 
the proliferation of said first microorganism. 
20 81 - The method of Claim 79, wherein said inhibitory nucleic acid comprises a portion of an antisense 

nucleic acid which inhibits the proliferation of said first microorganism. 

82. The method of Claim 79, wherein said inhibitory nucleic acid comprises an antisense molecule against 
the entire coding region of the gene involved in proliferation of the first microorganism. 

83. The method of Claim 79, wherein said inhibitory nucleic acid comprises an antisense nucleic acid to a 
25 portion of the operon encoding the gene involved in proliferation of the first microorganism. 

84. A compound identified using the method of Claim 79. 

85. A method for assaying compounds for activity against a biological pathway required for proliferation 
comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid encoding a gene product 
30 required for proliferation in a cell to reduce the activity or amount of said gene product- 

contacting the sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to a greater extent than 
said compound inhibits the growth of an nonsensitized cell. 
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86. The method of Claim 85 f wherein said cell is selected from the group consisting of bacterial cells, 
fungal cells, plant ceils, and animal cells. 

87. The method of Claim 86, wherein said cell is an £ coli cell. 

88. * The method of Claim 85, wherein said cell is from an organism selected from the group consisting of 
5 Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Aleisseria gonorrhoeae, 

Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae. Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoiormans, Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi. 
Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 
leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
10 trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

89. The method of Claim 85, wherein said antisense nucleic acid is transcribed from an inducible promoter. 

90. The method of Claim 89, further comprising contacting the cell with an agent which induces expression 
of said antisense nucleic acid from said inducible promoter, wherein said antisense nucleic acid is expressed at a sublethal 
level. 

15 91 ■ The method of Claim 90, wherein said sublethal level of said antisense nucleic acid inhibits proliferation 

by 8% or more. 

92. The method of Claim 90, wherein said agent is isopropyl-1-thio-P-D-galactoside (IPTG). 

93. The method of Claim 91, wherein inhibition of proliferation is measured by monitoring the optical 
density of a liquid culture. 

20 94 - The method of Claim 85, wherein said gene product comprises a polypeptide having a sequence 

selected from the group consisting of SEQ ID NOs: 243-357, 359-398. 

95. A compound identified using the method of Claim 85. 

96. A method for assaying a compound for the ability to inhibit cellular proliferation comprising: 
contacting a cell with an agent which reduces the activity or level of a gene product required for 

25 proliferation of said ceil; 

contacting said cell with said compound; and 

determining whether said compound reduces proliferation to a greater extent than said compound 
reduces proliferation of cells which have not been contacted with said agent. 

97. The method of Claim 96, wherein said agent which reduces the activity or level of a gene product 
30 required for proliferation of said cell comprises an antisense nucleic acid to a gene or operon required for proliferation. 

98. The method of Claim 96, wherein said agent which reduces the activity or level of a gene product 
required for proliferation of said cell comprises an antibiotic. 
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99. The method of Claim 96, wherein said cell contains a temperature sensitive mutation which reduces the 
activity or level of said gene product required for proliferation of said cell. 

100. The method of Claim 99, wherein said antisense nucleic acid is directed against the nucleic acid 
encoding the same functional domain of said gene product required for proliferation of said cell to which said antisense 

5 nucleic acid is directed. 

101. The method of Claim 99, wherein said antisense nucleic acid is directed against the nucleic acid a 
different functional domain of said gene product required for proliferation of said ceil than the functional domain to which 
said antisense nucleic acid is directed. 

1 02. A compound identified using the method of Claim 96. 

10 103. A method for identifying the pathway in which a proliferation-required nucleic acid or its gene product 

lies comprising: 

expressing a sublethal level of an antisense nucleic acid directed against said proliferation-required nucleic acid in 

a cell; 

contacting said cell with an antibiotic, wherein the biological pathway on which said antibiotic acts is known; 

15 and 

determining whether said cell has a substantially greater sensitivity to said antibiotic than a cell which does not 
express said sublethal level of said antisense nucleic acid. 

1 04. A method for determining the pathway on which a test compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a proliferation-required nucleic acid in 
20 a cell, wherein the biological pathway in which said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said eel! has a substantially greater sensitivity to said test compound than a cell which 
does not express said sublethal level of said antisense nucleic acid. 

1 05. The method of Claim 1 04, further comprising: 

25 (dl expressing a sublethal level of a second antisense nucleic acid directed against a second 

proliferation-required nucleic acid in said cell, wherein said second proliferation-required nucleic acid is in a 
different biological pathway than said proliferation-required nucleic acid in step (a); and 

(e) determining whether said cell has a substantially greater sensitivity to said test compound than a 
cell which does not express said sublethal level of said second antisense nucleic acid. 

30 106. A purified or isolated nucleic acid consisting essentially of one of SEQ ID NOs: 358, 399-402. 

1 07. A compound identified using the method of Claim 23. 

108. A compound which interacts with the gene or gene product of a nucleic acid comprising a sequence of 
one of SEQ ID NOs: 82-88/90-242 to inhibit proliferation. 
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109. A compound which interacts with a polypeptide comprising one of SEQ ID NOs. 243-357, 359*398 to 
inhibit proliferation. 

110. A compound which interacts with a nucleic acid comprising one of SEQ ID NOs: 358, .399-402 to inhibit 
proliferation. 

5 
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SEQUENCE LISTING 



<110> ELITRA PHARMACEUTICALS, INC. 
Zyskind, Judith 
Ohlsen, Kari L. 
Trawick, John 
Forsyth, R. Allyn 
Froelich, Jamie M. 
Carr, Grant J. 
Yamamoto, Robert T. 
Xu, H. Howard 



<120> GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION- IN 
ESCHERICHIA COLI 

<130> ELITRA. 001VPC 

<160> 485 

<170> FastSEQ for Windows Version 3.0 

, <210> 1 

<211> 159 
<212> DNA 
<213> E. Coli 

<400> 1 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 
atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 
aaaacgttca tattgtacct t'tttgattaa ccattgggg 

<210> 2 

<211> 696 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (696) 
<223> n = A, T, C or G 



ggcctccagt tgcccgnccc aaacactttg gncccc 

<210> 3 
<211> 681 
<212> DNA 
<213> E. Coli 



<400> 2 



gattacatca agcgcgcggt 
aaagagctga cgattcaacc 
gtcacctact caaacgtgga 
ggggaagcga ccagcggatt 
cgtctttccg agcgtaaaga 
attgcgcagg atcaggtggg 
attgttcctc cgggacaata 
cgttactggg gctttgtgcc 
v^cttcgata accaagaagg 
cent taa tan cca cttcctt 
taatcncccg tggctnacaa 



gggtttaccg ggcgataaag tcacttacga tccggtctca 
gggatgcagt tccggccagg cgtgtgaaaa cgcgctgccg 
accgagegat ttcgttcaga ccttctcacg ccgtaatggt 
ctttgaagtg ccgaaaaacg aaaccaaaga aaatggaatt 
gacactgggt gatgtgacgc accgcattct gacagtgccg 
gatgtattac cagcagccag ggcaacaact ggcaacctgg 
cttcatgatg ggcgacaacc gcgacaacag cgcggacagc 
ngaagcgaat ctggtcggtc nggcaaegge tatctggatg 
cgaatggccg aatggtctgc cctaantcgc attggcgnnt 
cnctttgtcc ccttatggca acacttaatt tattntaaan 
atccccgcct tttnttaaaa atttccccna anttaaggtt 
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<220> 

<221> misc_f eature 
<222> (1) . . . (681) 
<223> n = A,T, C or G 
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<400 
ctgcagggta 
ccagtcggca 
acgtaagcgg 
gctttgtagc 
cttcgccagc 
cagccgcggt 
aaatatttcc 
ggtcaaagtt 
tacagcacac 
tggagatctt 
attgagatcg 
gaaagtatct 



> 3 
at gtcgccat 
gcgacaactt 
atggagtggc 
gtacgcgcgg 
ataaaagagg 
tgagtatcac 
tgatagcta t 
gatgtttttt 
gt agaaagca 
ttcttcactg 
ctatgaaata 
ggttaagaag 



taaactggcg 
gcgttaaagt 
cggaaacctc 
cat tggcaat 
aggatgctcg 
tgaataaagg 
cggtatcagg 
agtctgttgt 
cccgacaata 
gcctaaaaag 
tcaacaactt 
c 



caggcagcca 
cgcaaaatta 
atagtgaccg 
aagattcaga 
cgtatgcagc 
atcgttttcg 
aaccaggt ca 
caaagccgcn 
ctcctggcat 
ctgatattct 
ggaaaatctt 



aagagttgct 
tcatctgcac 
cccaccagt t 
tactcagact 
aactgctcca 
tcaatcaaat 
cgccatgcaa 
attataccng 
gggcgttaaa 
gtaaagagtt 
gnaaagcngg 



ccgctt ct ac 
tcactgcgtg 
ggcctgcat c 
cttccggggc 
gcgcaaattg 
gtggctgagc 
gtttcgtaat 
taaccggcac 
gctcacagga 
acacngtaac 
ttggaaaatg 



<210> 4 
. <211> 289 
<212> DNA 
<213> E. Coli 

<400> 4 

ggcagaattt tacgctgacc aatgacgcga cgacgtggca tggaaatact ccgttgttaa 

ttcaggattg tccaaaactc tacgagttta gtttgacatt taagttaaaa cgtttggcct 

tacttaacgg agaaccatta agccttagga cgcttcacgc catacttgga acgagcctgc 

ttacggtctt taacgccgga gcagtcaagc gcaccacgta cggtgtggta acgaacaccc 

gggaggtctt taacacgacc gtcacggatc aggatcacgg agtgctcct 

<210> 5 
<211> 815 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (815) 
<223> n = A, T, C or G . 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
681 



60 
120 
180 
240 
289 



<400 
gggagcttac 
aaatatgaag 
atatggcccc 
cgaggatttt 
tggcgcaggc 
ggcaatcgaa 
tgctgctgac 
taactgccag 
tctggacgca 
agaagaacgt 
tgcgctggaa 
ttgctgctaa 
gccaagccag 
cgattttntc 



> 5 
atcagtaagt 
ggggagagcc 
ctttttcact 
agaatggctg 
atgatggatt 
aacatgcgt a 
ggcgtgatca 
actgacttcg 
gctgttgctg 
gttgcgctgg 
ggcgacgtt c 
aagcgctgac 
aattcagaga 
agcatggtgg 



gaccgggatg 
cttatagacc 
tttatatctg 
aaat taccgc 
gcaaaaaagc 
agtccggtgc 
aaaccaaaat 
ttgcaaaaga 
gcaaaat cac 
tagcgaaaat 
tgggttctta 
gaagaactgg 
aactttccgc 
tcttcctncg 



agcgagcgaa 
aggtagtaca 
tgcggt ttaa 
atccctggta 
actgactgaa 
tattaaagca 
cgacggcaac 
cgctggtttc 
tgacgt t gaa 
tggtgaaaac 
tcagcacggt 
ttaaacacat 
ttcaccggag 
gagtt 



gataacgcat 
cgtttggtt.a 
tgccgggcag 
aaagagctgc 
gctaacggcg 
gcgaaaaaag 
tacggcatca 
caggcgttcg 
gttctgaaag 
atcaacattc 
gcgcgtatcg 
cgttttgacc 
gtcccaccca 



ctgcggcgcg 
gggggcctgc 
atcacatctc 
gtgagcgtac 
acatcgagct 
caggcaacgt 
ttctggaagt 
cagacaaagt 
cacagt t cga 
gccgcgttgc 
gccgttctgg 
tttgttgcaa 
cangganccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
815 



<210> 6 
<211> 403 
<2 12 >" DNA 
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<400 
caacactatt 
ttaaaccatt 
cgatagaaac 
ccttacagcg 
gctgcaaccg 
ttttttcagt 
ttctaccagg 



> 6 
ttgttgaccg 
tcattgcgat 
aagcatt gaa 
caaaaaggct 
gaagggttgg 
gcttctgcgt 
tctttagctt 



gaaaat ggaa 
t t acacagaa 
aggcacagca 
ggt gactaaa 
cttatttaac 
cgtctttgct 
ctttcagacc 



cactttccgc 
cggacgtcct 
gt agtcaaac 
aagt caccag 
ttcaacttca 
cacgcct tct 
caggccagtt 



aatgcctgtt 
gtcgcagtat 
agtgtgaaac 
ccatcagcct 
gcgccagctt 
tt cagagcag 
gcg 



gctat cacgc 
attaagtcgt 
gctactggcg 
gatttctcag 
cttccagagc 
ccggtgcaga 



<210> 7 
<211> 149 
<212> DNA 
<213> E . Coli 

<400> 7 

gagctttttt cagtgcttct gcgtcgtctt tgctcacgcc ttctttcaga gcagccggtg 

cagattctac caggtcttta gcttctttca gacccaggcc agttgcgcca cgtactgctt 

tgataacagc aactttgtta gcgccagca 

<210> 8 
<211> 742 
<212> DNA 
. <213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (742) 
. <223> n - A, T, C or G 



<400 
ccatctgtcc 
acttt ccgca 
ggacgtcctg 
tagtcaaaca 
agtcaccagc 
tcaacttcag 
acgccttctt 
aggccagttg 
agaattacgt 
acagcagcag 
cgtccattac 
ttaaattgtt 
aagataactg 



> 8 
attgagcgga 
atgcctgttg 
tcgcagtata 
gtgtgaaacg 
catcagcctg 
cgccagcttc 
tcagagcagc 
cgccacgtac 
cgaat t cagt 
caagcggaaa 
agacatagct 
cctgaattat 
ngattaaagc 



cagtttgtgc 
ctatcacgct 
t'taagtcgtc 
ctactggcgc 
atttctcagg 
ttccagagct 
cggtgcagat 
tgctttgata 
tntttcttca 
caccgaat tt 
gcaactgctt 
caagaaataa 
ag 



aacactattt 
taaaccattt 
gatagaaaca 
cttacagcgc 
ctgcaaccgg 
tttttca.gtg 
tctaccaggt 
acagcaactt 
gcagcttcaa 
ttcttccatt 
caatgatttt 
gtnttatacg 



t gtt gaccgg 
cattgcgatt 
agcattgaaa 
aaaaaggctg 
aagggttggc 
cttctgcgtc 
ctttagcttc 
tgttagcgcc- 
ccgggccagc 
gcagagatca 
gatctttagt 
taagccgaaa 



aaaatggaac 
tacacagaac 
ggcacagcag 
gtgactaaaa 
ttatttaact 
gtctttgctc 
tttcagaccc 
agcagctttc 
agctacagct 
gttctacaac 
ggatagacat 
tgcgttaaaa 



<210> 9 
<211> 421 
<212>' DNA 
<213> E. Coli 



60 
120 
180 
240 
300 
360 
403 



60 
120 
149 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
.720 
742 



<400> 9 



agtagtcaaa 
aaagtcacca 
cttcaact tc 
tcacgccttc 
ccaggccagt 
tcagaattac 
ctacagcagc 
c 



cagtgtgaaa 
gccat cagcc 
agcgccagct 
tttcagagca 
tgcgccacgt 
gtcgaatt ca 
agcag.cggaa 



cgctactggc 
tgat t t ctca 
tcttccagag 
gccggt gcag 
actgctttga 
gttttttctt 
acaccgaatt 



gcct tacagc 
ggct gcaacc 
cttttttcag 
att ctaccag 
taacagcaac 
cagcagcttc 
tttcttccat 



gcaaaaaggc 
ggaagggttg 
tgcttctgcg 
gtctttagct 
tttgttagcg 
aaccgggcca 
tgcagagatc 



tggtgactaa 
gcttatttaa 
tcgtctttgc 
t ctttcagac 
ccagcagctt 
gcagctacag 
agttctacaa 



60 
120 
180 
240 
300 
360 
420 
4.21 
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<210> 10 
<211> 126 
<212>' DNA 
<213> E . Coli 

<400> 10 

agagc-tttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 60 

gcagartcta ccaggtcttt agcttctttc agacccaggc cagrtgcgcc acgtactgct 120 

ttgaza 126 

<210> 11 

<211> 262 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_f eature 

<222> (1) . . . (262) 

<223> n = A, T, C or G 

<400> 11 

ctgcaaccgg aagggttggc ttatttaact tcaacttcag cgccagcttc ttccagagct 60 

tttttcagtg cttctgcgtc gtctttgctc acgccttctt tcagagcagc cgntgcagat 120 

tctaccaggt ctttagcttc tttcagaccc aggccagttg cgccacgtac tgctttgata 180 

acagcaactt tgttagcgcc agcagctttc agaattacgt cgaattcagt tttttcttca 240 

gcagcttcaa ccgggccagc ag 262 

<210> 12 

<211> 202 

<212> DNA 

<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (202) 
<223> n = A,T,C or G 

<400> 12 

gcgca-accc tgcagcatcg gcccgatgga gatcaggtcg gcagaacgct gtaccgcttt * 60 

gtaggzggtg ttac'cggtgn tcagatccgg gaagatgaac acggtagcgc gacctgcaac 120 

cggacagttc ggcgctttgg attncgcaac gtcagccatt accgcagcgt cgtactgcag 180 

cggaccggcg atcatcaggt ca 202 

<210> 13 
<211> 261 
<212> DNA 
<213> E . Coli 

<400> 13 

tctaggagta agaatagctt caaattcagc agttgacagt ggcataaacg taactggtga 60 

ctttrgcccg gcatgacgcc gggctttttt tattattccg tgacttccag cgtagtgaag 120 

gcaaacrtct cgccatcaaa tagcccctga ctggttagtt ttagcgcggg gatcactggc 180 

agngaaagaa acgccatctg aataaacggc tcatcgggta acggaccgca ttcacgggcg 240 

gcggczttca aggcgicaat t 261 

<210> 14 
<211> 224 
<212> DNA 
<213> E. Coli 

<400> 14 



eNSDOCID: <WO. 
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ttcttttttt cgtcaacggt gtccagaatc attttattta cctcggggta cttatgctga 

tttttattat tatggggaag gtgttattta tgagtttcat ttatgccgta acgacaatga 

actcgggaat tagtataagc agcgcgagaa taataatcat tgtgcaaatg ctaatttaat 

taatactatt taaatattat tttgagcata tgcacataag gt.tg 

<210>15 
<211> 232 
<212> DNA 
<213> E. Coli 

<400> 15 

aattcccttc tttttttcgt caacggtgtc cagaatcatt ttatttacct cgggtactta 

tgctgatttt tattattatg gggaaggtgt tatttatgag tttcatttat gccgtaacga 

caatgaactc gggaattagt ataagcagcg cgagaataat aatcattgtg caaatgctaa 

tttaattaat actatttaaa tattattttg. agcatatgca cataaggttg gg 

<210> 16 

<211> 212 

<212> DNA 

<213> E. Coli 

<400> 16 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 

cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 

tatgccgtaa cgacaatgaa ctcgggaatt agtataagca gcgcgagaat aataatcatt 

gtgcaaatgc taatttaatt aatactattt aa 

<210> 17 
<211>433 
<212> DNA 
<213> E. Coli 



. 60 
120 
180 
224 



60 
120 
180 
232 



60 
120 
180 
212 



<400 
ccttgtaaat 
caggccataa 
agcggggcgg 
cacagcaaca 
atctttagcc 
gtcgcgacac 
gcgcacattc 
gtcgcgccca 



> 17 
tatcgcccgt 
atgccaccag 
agagctgttt 
tgatgcctct 
acgcgaccag 
aagcaacagg 
cggtaatcga 
ttg 



ggcataaaaa 
aattatcgtc 
cagttcggcg 
gtacaaccct 
gacaccgttg 
tagtgaatat 
acccactgaa 



ct gcgtccaa 
aaccaaccaa 
ggtaaccctt 
aacgtgccaa 
aaaaatcccg 
cctgcgttca 
acatcaatat 



acgccgtctt 
ttgctgaaac 
caatccattt 
gggtggcaac 
cgagcaaacc 
gtaacatccc 
tgcgcgt aag 



tgccagcagc 
gccaagcagc 
gccgccagt c 
aatggcaggg 
aagcagtaaa 
caacagcac'c 
cat t accagc 



60 
120 
180 
240 
300 
360 
420 
433 



<210> 18 
<211> 658 
<212> DNA 
<213> E. Coli 



<400 
cgtgcgcttc 
tcctt ccccg 
ctgcaacgcc 
gactgctctg 
cgcggaacag 
tgttaactgg 
ggacgcgata 
tggtgtcact 
acgggagcta 
aaaaggatgt 
ggagagccag 



> 18 
cggttgtggc 
ttgaggacac 
ctctgttarc 
ccgccctttt 
ttaaaaccaa 
ttaacgtcac 
atgaaaacgt 
atcagtaacc 
ttaaataaaa 
gcacaatgaa 
ttgatattgc 



aacccgcgaa 
cgggttgtca 
aattttctgg 
taaagt gaat 
aaacagtgtt 
ctggaggcac 
tat taccaaa 
cagtatttae 
tatgcattgt 
ttcagcattt 
agt caagtgg 



atggcgcggc 
ggttgaccat 
tgacgtttgg 
tttgtgatgt 
atgggtggat 
caggcactgc 
cgttaatacg 
tgaagatgcc 
ttcaatgctg 
gtgcttgttc 
tcacaggaca 



ggtaagtatg 
acgcttaagt 
cggtat cagt 
ggtgaa tgcg 
tctctgtatc 
atcacaaaat 
t ctgaaggt z 
attaacaaga 
gctcgtttac 
tgacagtttt 
atgcaggagt 



gcggggttat 
gacaaccccg 
tttactccgt 
gctgagcgca 
cggcgt taa t 
tcattgttga 
gttttgaaat 
gaaaacaaga 
gtctgatgcc 
tcttgtttcc 
gt atgact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
658 



<210> 19 
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<211> 588 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> • (1) . . . (588) 
<223> n = A, T, C or G 

<400> 19 

gtgactgctc tgccgccctt tttaaagtga attttgtgat gtggtgaatg cggctgagcg 60 

cacgcggaac agttaaaacc aaaaacagtg ttatgggtgg attctctgta tccggcgtta 120 

attgttaact ggttaacgtc acctggaggc accaggcact gcatcacaaa attcattgtt 180 

gaggacgcga taatgaaaac gttattacca aacgttaata cgtctgaagg ttgttttgaa 240 

attggtgtca ctatcagtaa cccagtattt actgaagatg ccattaacaa gagaaaacaa 300 

gaacgggagc tattaaataa aatatgcatt gtttcaatgc tggctcgttt acgtctgatg 360 

ccaaaaggat gtgcacaatg aattcagcat ttgtgcttgt tctgacagtt tttcttgttt 420 

ccggagagcc agttgatatt gcagtcagtg ttcacaggac aatgcangag tgtatgactg 480 

cagcaacccg aacagaaaat tcccggtaac tgttacccgg tcgataaagt tattcaccag 540 

gataatatcg aaatcccggc aggtctt taa aacagttccg taataaat 588 

<210> 20 

<211> 101 

<212> DNA 

<213> E. Coli 

<400> 20 

gatccagcaa gaagatgcgg ttgtaccgtc atcacgcaga tgcgcaaagc tactcagcaa 60 
ctgacctttc ttcgcaataa gcacgccatt agcgtcatag a 101 

<210> 21 
<211> 465 
<212> DNA 
<213> E . Coli 

<400> 21 

tcgcgtgttt accttcaaca tcggtaactt tctggcggat agtttcacgg taagcaacct 60 

gcggtttacc tacgttcgct tcaacgttga attcacgctt catacggtca acgatgatgt 120 

cgaggtgcag ttcgcccata cccgcgatga tggtctggtt agattcttcg .tcagtccata 180 

cacggaaaga cgggtcttct ttagccagac ggcccagagc cagacccatt ttttcctggt 240 

cagctttggt tttcggttca actgcgatgg agattaccgg ctcagggaat tccatacgtt 300 

ccagaatgat cggcgcatcc gggtcacaca gggtgtcacc agtggttacg tctttcagac 360 

cgatagcagc agcgatgtcg cccgcgcgaa cttctttgat ctcttcacgt ttgttagcgt 420 

gcatctgaac gatacgaccg aaacgctcac gtg.cagcttt cacgg 4 65 

<210> 22 
<211> 859 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (859) 
<223> n = A, T, C or G 

<400> 22 - . 

tgatcggctc aagcagaact ggtttcgctt tcttaaagcc ttctttaaag gcgatagaag 60 
cagccagttt aaacgccagt tcagaggagt caacgtcatg gtaagaaccg aagtgcagac 120 
gaatacccat gtctactacc gggtagcctg ccagcggacc tgctttcagc tgttcctgga 180 
tacctttatc aacggccggg atgtattcgc cagggattac accaccttta atgtcgttga 240 
tgaactcgta gcctttcggg tttgaacccg gctccagcgg gtacatgtcg ataacaacat 300 
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gaccatactg accacgacca ccagactgtt tcgcgtgttt accttcaaca tcggtaactt 360 

tctggcggat agtttcacgg taagcaacct gcggtttacc tacgttcgct tcaacgttga 420 

attcacgctt catacggtca acgatgatgt cgaggtgcag ttcgcccata cccgcgatga 480 

tggtctggtt agattcttcg tcagtccata cacggaaaga cgggtcttct ttagccagac 540 

gggccanagc cagacccatt ttttcctggt cagctttggt tttcggtcaa ctgcgatgga 600 

gattaccggc tcanggaatt tccatacctt ccaggaatga tcggcgcatt ccggtcaaac 660 

anggngtacc aggggggtac ntntttttaa nancgattgc cagcancgga tntnncccgn 720 

gccnaacttc tttggaacnn tttaccggtt ggtaaccngc cttttnaacn atccaaccga 780 

aaaagngtta anngccantt ttccnggngt tnanntncgg nttcccngaa ntaacccncc 840 

cggggtnaac ccngnaaaa 859 



<210> 23 
<211> 269 
<212> DNA 
<213> E . Coli 



<400> 23 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg ' 60 

agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 120 

ctgccagcgg acctgctttc agctgttcct g'gataccttt atcaacggcc gggatgtatt 180 

cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 240 

ccggctccag cgggtacatg tcgataaca 269 



<210> 24 
<211> 330 
<212> DNA 
<213> E . Coli 



<400> 24 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gfcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata .acacaaggaa acgtacttaa ggtgcgtccg gtgaaccagt 180 

cggacgcacc tttaataact ataaataagt gtctgggcag atactatata aattaactta 240 

gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag gcttcccagt 300 

aatataatta atactctact tccagagtag 330 



<210> 25 
<211> 471 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_feature 
<222> {!)... (471) 
<223> n - A, T,C or G 



<400> 25 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa ggaaaaacga 60 

atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc tgttttacag 120 

ctgatccttc tgttcttata acacaaggaa acgtacttaa ggtgccgtcc ggtgaaccag 180 

tcggacgcac ctttaataac tataaataag tgtctgggca gatactatat aaattaactt 240 

agtgaatgat tatgctaatg tcatcaatta aataaatata atggcgttaa ggcttcccag 300 

tcritataatt aatactctac ttccagagta gaatattaaa ttttatccgc gtggtgcatc 360 

accacaaatt tatcccacaa ctgttcttct gtctcgacat gccccccgat ctttnacaaa 420 

tantattggg ggattnggcc cncctttttg ncaggttggg gtcntctnat g 471 



<210> 26 
<211> 379 
<212> DNA 
<213> E. Coli 
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<220> 

<221> misc_feature 

<222> (1) . . . (379) 

<223> n = A, T, C or G 

<4 00> 2 6 

natctgantg gctgcattcc ttgtttaagg aaacccgaat gactgattgc cgatacctga 60 

ttaaacgggt catcaaaatc atcattgctg ttttacagct gatccttctg ttcttataac 120 

acaaggaaac gtacttaagg -tgcgtccggt gaaccagtcg gacgcacctt taataactat 180 

aaataagtgt ctgggcagat actatataaa ttaacttagt gaatgattat gctaatgtca 240 

tcaattaaat aaatataatg gcgrtaaggc ttcccagtaa tataattaat actctacttc 300 

cagagtagaa tattaaattt tatccgcgtg gtgcatcagc acaaatttat cccacaactg 3.60 

ttcttcrgtc tcgacatgc " 379 

<210> 27 
<211> 799 
<212> DNA 
<213> E. Coli 

<400> 27 

aaagatgatg tgatgagaaa gtcaatttga ataagacaat attaagagct aaaaaaatgt 60 

caaaaaacac taaatcaaaa aataatggca ttagaaaata taatgcgaaa acggaggtga 120 

aattagttta tttcaaatga ggaaaatctc ccggcgaaaa aaccgggaga tgaaagtgtg 180 

atgggiatca aataaacaac agaggagaaa tttttaacgc agccattcaq gcaaatcgtt 240 

taatcccatt gcctggcgga taagttgcgg cttaacgcca ggaagcgtgt cggccagttt 300 

caaaccaata tcacgcagca gttttttcgc cggattggta ccggaaaaca gatcgcggaa 360 

tcccrgcata ccagccagca tcaacgccgc actgtgcttg cggctacgct cat'agcgacg 420 

cagataaatg tactgcccga tgtctgggat ccgtcgacct gcagccaagc ttgggctttt 480 

cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg 540 

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgcccgta 600 

gcgcccga.tg gtagtgtggg gtctccccat gcgagagtag ggaactgcca ggcatcaaat 660 

aaaacgaaag gctcagtcga • aag'actgggc ctttcggttt atctggtggt tgtcggtgaa 720 

cgctctctga gtaggacaaa tccgccggga gcggattttg aacgttgcga aacaaccggc 780 

ccggaaaggg gtgggggct 799 

<210> 28 
<211> 636 
<212> DNA 
<213> E . Ccli 

<220> 

<221> misc_f eature 
<222> (1) . . . (636) 
<223> n = A, T , C or G 

<400> 28 

agggggtttg ttgtgggcaa tgatgcattt aagttatcgt ctgcagatag aggagatatt . 60 

acaataaaca acgaatcagg gcatttgata gtcaataccg caattctatc aggagatata 120 

gtcacictaa gaggaggaga aattaggttg gtattatagc ttgtgcgcgc catga'ttggc ' 180 

gcgcaattta aacttagrgc tttacatcgc tattgtcttg atttctttga attattttat 240 

aaattaaaaa aacgactgtt atgtataagc aaaggtcgaa cgaaaaatac attccaaata 300 

aatgcttgct taaatcicta tatccttccc cgaaaaatga cacataaaat tgagatattc 360 

caaaaagaga tactacaaat aaagatgcct ttattttatt atttctaata aaaatagaag 420 

caataaaaaa taataacaat gatataaatc taatgttttt aaatatattg tcttttatgt 480 

tagraatagt cgttagtatg tttgattctc catatattac gtgtagtttt ttatatacat 540 

ggaaataatt ntctt tatac tgagacatca caccatcatc aaatggaagt tcgaagatgg 600 

tgcttggttt gctaaccaat aaaaagagtg cattcg 636 

<210> 29 
<211> 757 
<212> DNA 
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<2i3> E . Coli 
<220> 

<221> misc_f ea ture 
<222> (1) . . . (757) 
<223> n = A, T, C or G 



PCT/USOO/02300 



<400 
cagcggtcgt 
gatgaaaaaa 
gctgggatgg 
gtgcaattt a 
ataacggcaa 
gtgctgat cc 
ccacctccat 
ataatccatg 
tcgctgataa 
ggtttttttt 
caaacatcat 
cacttcaagc 
ccggaaacgc 



> 29 
atttttagca 
acaacgatta 
tggtaacgtc 
agcgttaact 
ccacgaagct 
at ccgccgac 
ctttacctgg 
acagaatgcc 
aaagcatagc 
gcgatggcgc 
gttcgttttc 
cggcacttct 
cattgctggt 



tggtttttta 
tt atgatggg 
acctctaaaa 
tttaatcttc 
gccaaaattg 
t acggcaccg 
catgatccac 
cattgtttcc 
acaacat egg 
agaaattgeg 
cgtgtcataa 
ggcaccagca 
attggtgacn 



ttggcggcta 
tgtggcgatt 
aatagcaaag 
ctgtagataa 
aagecatega 
actatcccca 
ttcgccagaa 
tcacttatct 
gagggcaaga 
ccatcaacga 
gaaccgtacg 
aagtcategg 
gtcaegg 



tgctgccccg 
attgtegtae 
gctgcctgtg 
atagcacgac 
ctt taccaaa 
gcaggatagt 
taceggcaat 
gttttgeatt 
t ttgtgacga 
tpagtgataa 
ggattcacca 
cgt ctctggt 



ggagcataaa 
teggcactga 
tgcagccttt 
aatcgcacca 
gccaaacagc 
ca t aaagaat 
aagcccaaaa 
agegggttag 
geatcaegga 
tt accaacca 
gatcttttat 
tcataatcga 



<210> 30 

<211> 392 

<212> DNA 

<213> E . Coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
757 



<400 
aattacagaa 
acaaatatta 
agaccacat c 
acaccggcgc 
catggtagcg 
gcgcctgatg 
gctgcgccac 



> 30 
aaaggaggca 
ttgtgct gca 
gacctgatca 
ggcategget 
caegctatat 
aatcgcgtta 
gcccagcgac 



atategggta 
ggtgttttag 
tcaaactgaa 
t'tcatcatcc 
accggcccca 
t caategctg 
acagaacgaa 



aaggcattag 
cgggttgttg 
t ageggectg 
gcaccattgg 
gtttacgatg 
ccttacgcgc 
tt 



cccgacgaat 
atccacaggt 
ctegtaagt t 
gctgggctga 
aaagccgttc 
tttgtcttta 



aegteggget 
tctaactgga 
tectgggegg 
tagttggaaa 
gccagttcct 
taggcatccg 



<210> 31 
<211> 351 
<212> DNA 
<213> E. Coli 

<400> 31 
ctatcctt'ga tgaaaccgcg 
acagaaaaag gaggcaatat 
atattattgt getgeaggtg 
cacatcgacc tgatcatcaa 
cggcgcggca teggctttea 
gtagcgcacg ctatataccg 

<210> 32 

<211> 762 

<212> DNA 

<213> E . Coli 



agcaaagata 
egggtaaagg 
t tttagcggg 
actgaatagc 
tcarccgcac 
gccccagttt 



ggtgattacg tcatggtttt acagaaaatt 

cattagcccg aegaataegt egggctacaa 

ttgttgatcc acaggttcta actggaagac 

ggcctgctcg taagtttcct gggeggacac 

cattgggctg ggctgatagt tggaaacatg 

acgatgaaag ccgttcgcca g 



60 
120 
180 
240 
300 
360 
392 



60 
120 
180 
240 
300 
351 



<220> 

<221> misc_feature 
<222> (1) . . . (762) 
<223> n = A,T,C or G 

<400> 32 

aattatgaaa cactgtctgg aatcgtctga atgaegggea catttgegag cacgcatcca 



60 
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gtaataacac aggaaacWt tttatctacg cgttagcgat 

ggaggtaagc cgacgatrtc agcgggacgc tgaaacggga 

gccataaata aggaaagggt catgatgaag ctactcatca 

agcttccccg cttactaaga ctaccagggc gggggaaacc 

aaagtatgcc ttcacgataa gattgtcaat ccgcaggctt 

gcaaatattc tttgcgagtc gttacgcaat aatcacagag 

ttagcgatag actgcattca gggcgaaagg aggtaagccg 

aaacgggaaa gcctctcccg gagaagaggg cttttaataa 

acgtcatcat actggtgata ctcttaqtga ttagcttcca 

gggaggggga aacctcttcc taacectcac ttctgaaatt 

tactgcttan cgctaccagt ttgtctgccc tggcggttgt 

ttggatattt taatgaaagc cgacaaatca atcancgtga 

<210> 33 
<211> 293 
<212> DNA 
<213> E. Coli 



PCT/USOO/02200 



agactgcttg 

aagcccct cc 

tcgtggtgct 

ccgctctacc' 

tgtagtctgc 

gaaactattt 

atgatttcag 

ggaaagggt t 

ggcttactaa 

gggtgctatg 

aacgccagat 

eg 



catggcgaaa 
cgaggaaggg 
cttagtcata 
ctcactcctg 
gatcctgcca 
tattcacgeg 
egggaegctg 
atgatgaagc 
gaacaccagg 
acgctggcgt 
cggtacccgt- 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
762 



<400> 33 

geacatttge gagcaegcat ccagtaataa cacaggaaac tattttatct aegegttage 

gatagactgc ttgcatggcg aaaggaggta agecgacgat ttcagcggga cgctgaaacg 

ggaaagcccc tcccgaggaa ggggccataa ataaggaaag ggtcatgatg aagctactca 

tcatcgtggt gctcttagtc ataagcttcc ccgcttacta agactaccag ggcgggggaa 

accccgctct accctcactc ctgaaagtat gccttcacga taagattgtc aat 

<210> 34 
<211> 633 
<212> DNA 
<213> E. Coli 



60 
120 
180 
240 
293 



<220> 

<221> misc_f eature 
<222> (1) . . . (633) 
<223> n = A, T, C or G 



<400 
atttacactt 
tatggcagga 
ttaaaaataa 
cagccaccgc 
ttgetgaett 
agtacaacat 
ggacctatat 
cctacgacgg 
tgcagacacc 
gtatttcacg 
aagacgtagt 



> 34 
tttacgaaat 
aagatatgcg 
gatgttgctg 
gggttcgacc 
taagcatttc 
taagcagtgg 
gggtggcgcg 
tgagattttt 
ttgntatttt 
ttgattaatg 
aactggctgg 



catgggatca 
actgatatta 
ggtgcgcttt 
aatacctegg 
aaaccagggg 
cagttgcgt a 
tacgtgttga 
tatcatcget 
ttattattag 
cnggtggctc 
ttatcggaat 



ctaacaaaat 
cagat cccca 
tgctggttac 
gaatttctaa 
acaccgt acc 
acctgcccgc 
tcagcgacac 
aaaaaaagee 
ccactt gctc 
cagtgcgcca 
tgg 



atcgcttgtc 
aagtggagag 
cagtgccgcc 
gtatgagtta 
agaaatgtac 
gcctgatgcc 
egaeggtaaa 
ccctcatcat 
gtcttgcttg 
gattaacttt 



agttatattg 
t ttatgacca 
tgggccgcac 
agtagtttca 
cgtaccgatg 
gggaegcact 
atcattaaag 
gagggggaaa 
gtattaagtc 
gtttggatcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
633 



<210> 35 
<211> 569 
<212> DNA 
<213> E. Coli 



<400 
tatggcagga 
ttaaaaataa 
cagccaccgc 
ttgetgaett 
agtacaacat 
ggacctatat 
cctacgacgg 



> 35 
aagatatgcg 
gatgttgctg 
gggttcgacc 
taagcatttc 
taagcagtgg 
gggtggcgcg 
tgagattttt 



actgatatta 
ggtgcgcttt 
aatacctegg 
aaaccagggg 
cagttgcgt a 
tacgtgttga 
tatcatcget 



cagat cccca 
tgctggttac. 
gaatttctaa 
acaccgtacc 
acctgcccgc 
tcagcgacac 
aaaaaaagee 



aagtggagag 
cagtgccgcc 
gtatgagtta 
agaaatgtac 
gcctgatgcc 
egaeggtaaa 
ccctcatcat 



tttatgacca 
tgggccgcac 
agtagtttca 
cgtaccgatg 
gggaegcact 
atcattaaag 
gagggggaaa 



60 
120 
180 
240 
300 
360 
420 
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tgcagacacc ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 480 
tatttcacgt tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcg 540 
tagacgtagt aactggctgg tatcggaat ~ 559 

<210> 36 

<211> 338 

<212> DNA 

<213> E . Coli 



<400> 36 

cgtattcaca tccttttgat tggtgataac atgcgaatcg gtattatttt tccggttgta 60 

atcttcatta cagcggtcgt atttttagca tggtttttta ttggcggcta tgctgccccg 120 

ggagcataaa gatgaaaaaa acaacgatta ttatgatggg tgtggcgatt attgtcgtac 180 

tcggcactgc ctgggatgqt ggtaacgtca cctctaaaaa atagcaaagg ctgcctgtgt 240 

gcagcctttg tgcaatttaa gcgttaactt ttaatcttcc tgtagataaa tagcacgaca 300 

atcgcaccaa taacggcaac cacgaagctg ccaaaatt 338 

<210> 37 
. <211> 375 
<212> DNA 
<213> E. Coli 



<400 
ctgaatatt t 
ataaaatatt 
caaagaagca 
ttttaattta 
atacataaat 
tccatctcta 
taaaacaact 



> 37 
aaaaaggaaa 
tccgtggagc 
ttgaat gcag 
acatgtaaac 
ggagtcatgt 
attgcat att 
gtaca 



acgacatgaa 
attttattat 
ggaaaaataa 
gca tggttaa 
tttccctttt 
ttaatttttc 



accgaagcac 
tgaatataga 
tatggccata 
tcctcatatc 
ccatttatca 
tgataaatgg 



agaat caaca 
ggt ttaactc 
aaaaacatcg 
acgggtggag 
agttcctgtt 
cattgagcat 



ttctccaatc 
cggtaaaaaa 
aaagaaactc 
tgttaagaac 
gccgttttag 
cgattt catt 



60 
120 
180 
240 
300 
360 
375 



<210> 38 
<211> 446 
<212> DNA 
<213> E. Coli 



<400 
t tacgatagc 
tgactacctt 
tatttaaaaa 
atatttccgt 
aagcattgaa 
atttaacatg 
taaatggagt 
ctctaattgc 



> 38 
tattagtaaa 
cgtttttttg 
ggaaaacgac 
ggagcatttt 
tgcagggaaa 
taaacgcatg 
catgttttcc 
atattttaat 



aatataagag 
attaagaat g 
atgaaaccga 
attattgaat 
aataatatgg 
gttaatcctc 
cttttccatt 
ttttct 



t t agctgtat 
attttattat 
agcacagaat 
atagaggttt 
ccataaaaaa 
atat cacggg 
tatcaagttc 



tgttatgtct 
cgtaagtaaa 
caacattctc 
aactccggta 
cat cgaaaga 
tggagtgtta 
ctgttgccgt 



gtggcgaaat 
attacatgaa 
caatcataaa 
aaaaacaaag 
aactctttta 
agaacataca 
tttagtccat 



60 
120 
180 
240 
300 
360 
420 
446 



<210> 39 
<211> 392 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (392) 
<223> n = A, T, C or G 

<400> 39 

tcaccccggt gccgattttc aggcatcctg atttaactta gcacccgcaa cttaactaca 60 

ggaaaacaaa gagataaatg tctaatcctg atgcaaatcg agccgatttt ttaatcttta 120 

cggactttta cccgcctggt ttattaattg cactgtnatc cgggcgttcg cccgctttaa 180 

tcacaatagg ctgtgtagcc tgggcctgtt tctctttcac ccgcgccaga gcggcagcaa 240 
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tcgcarcttt atctt tggct gcaggttgaa cggctgcgct * cttatgtcgr tcaaggcgag 300 
ccgctrtttc gcgcrccaga cgagcctggc gcgcttcgaa acgcgctttg gcttctgcgg 360 
•cncgcrtttc ttcctgacga atagccgcaa tt 392 

<210> 40 

<211> 208 

<212> DNA 

<213> E . Coli 

<400> 40 

taataacgct atctgcggat aaagcagaat aggtggttaa ccccagacat aaaccgagga 60 

aaataatgtt attgtatttc ataatctatt gttccttagc gacagattgc tgtctgctgg 120 

ttcagraagg taccaggaga aacttcagga agcttgtact cgacaataca gtttgagttt 180 

ttatctttgc cccatgaaac ctgtaatt 208 

<210> 41 
<211> 342 
<212> DNA 
<213> E . Coli 



<400> 41 

catcctcaat accgttaaat gcaacccgaa cccccgttgt ccctttgctg cattcactta 60 

acgtaatctg aaaagggacg gcrggacttg tgctaccggt cgtcggaaat tgtctggcac 120 

tgtttttttg gagatctacg gtaaaattaa gcgaatccga tgagactgtg cagccataat 180 

cgaggacgcg cccgctaatt ttaataacgc tatctgcgga taaagcagaa taggtggtta 240 

accccagaca taa.accgagg aaaataatgt tattgtattt cataatctat tgttccttag 300 

cgacagattg ctgtctgctg gttcagtaag gtaccaggag aa ^ 342 



<210> 42 
<211> 841 

<212> DNA , ... 

• <213> E. Coli 

<220> 

<221> raisc_feature 
<222> (1) . . . (841) 
<223> n = A, T, C or G 

* <400> 42 
agatitactg ccaatttccg gcagatcgga 
acgaatcacg gctataccgc caggcatggc 
gggcgctgat tcttcccacg cggttatttt 
aggarcgttg agcagcagat gatctaccag 
ctgaataccc gnnagaaaag gtgccacagc 
tggcaatngc aatcattttt tgcacttant 
ttntatgccg tcataacttt acgtatgtan 
gctncaacac gtaaatttna ttgnccccna 
ccccncnnan .ttttaaqntn ttnanagaat 
anttrtttng naataattcc tntatcnaan 
cggtutnntn nttnncnngg cnttttttta 
tagnctattt naagnggnct ttnttnnatt 
ttttatatan aattnrctta tntcntaatt 
gantcccnnt ccctntttaa tnnatgntct 
c 

<210> 43 
<211> 215 
<212> DNA 
<213> E . Coli 

<400> 43 



aagggttaaa ccatattgat ccataagggt 60 

ttgagccatg gcattaaatt ccgcaaattc 120 

ggcacacacc agatccagca aggggttntc 180 

ttncagcgcc tgggtgtatt gntccttgtt 240 

anttagcttn tctcctgctt gcaagatgtc 300 

acgatgnaca ncngtaaaga aatcgnattt 360 

cacrttttgc nattcnaaaa aagaccattn 420 

catttanaac ataaatgntt aaaattttcc 480 

ngggaattac ctgcttttna atgnactcan 540 

ctnnttttcn cccaanagnc nnccaaattn 600 

cccnanaann tttattcaan nccttttttg 660 

aactttccnn ttggncaaat tttggcnnat 720 

tnggnanccc cngatgnaan tttatggngg 780 

gggntatttt taaancctnn attaannnan 840 

8 41 
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aataactttt cgttaggcag ttttgggtgt gagttgcaag aggggagact actgaataac 60. 

tcaagtttta taatcgaggg gaaaatggtg atggcgttca tagcaaaacg ccctcaacca 120 

taaaggtcga gggcgcttaa gatgttaaaa acccgctatc cgttaaaaaa caatgttcaa 180 

ctaaggtcag tgacattgcg ctaaaaaagc gaatt 215 

<210> 44 
<211> 395 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (395) 
<223> n = A, T, C or G 



<400 
gcattattca 
ttaaggttta 
gcattgctat 
taagggacag 
attgaatgtt 
aaattcatat 
atattgaaat 



> 44 
tgagaaatgt 
attatctgtg. 
aatattggtt 
gcatagagta 
gaccctatgt 
aatgcgcctt 
agtccaacta 



gtatcgtaaa 
tgtgatattt 
atcatttgct 
at gatacgta 
gttt at gaag 
tgctc.atgaa 
cttctttatt 



tcaactgaaa 
tattgaatgt 
gaatggattc 
tgcat aacca 
gagaggtatt 
tggatgccag 
accaa 



ttaacgcaac 
tttaaatatt 
agtcttaatg 
acatctttac 
ttcagttgat 
tatgtagtgg 



catttgttat 
gtttttattg 
agtgggtttt 
tcattatgtc 
ctggattgnt 
gaaattataa 



60 
120 
180 
240 
300 
360 
395 



<210> 45 
<211> 883 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_feature 
<222> (1 ) . . . (883)' 
<223> n « A, T, C or G 



<400> 
ataatcaggt 
cgcgtggcaa 
atgatatctt 
ctatacttat 
attgattatt 
aaattaatta 
gcgct cctgt 
gnnt gcncgn 
cgatngtcgg 
cgaaaggctn 
tcctgagnan 
agggngnngt 
ancngatggc 
cannntncaa 
anntnngggg 



45 

aagaaaaggt 

tacatcagtg 

ccgat ttatc' 

tcagcactca 

ttcctgattg 

atgtcatcag 

catg.tgcatt 

cgncccacct 

cgggngcctc 

attncaaaga 

gacaaatncc 

cntgacnccc 

ctttt tngcc 

tccngtnntn 

gnngngngcc 



gcgcggagat 
gcaa t aaaac 
ttaatcgttt 
caaataaagg 
ggctactggt 
gt ccgaaaat 
gcttcatata 
naccccatgc 
cccatgcnan 
ctgggcct tn 
gccgggagcg 
nnctctanct 
ntct accaaa 
cccatccnnn 
ggcggncccc 



taccgtgtgt 

gacatat cca ■ 

atggataacg 

aacgccaatg 

ggtgactggc 

aacgagaata 

at cactggcg 

cgaacttcag 

agtangggaa 

cntttatctg 

gatttgaacn 

nncngcctt c 

cnntttggt t 

tnntgaaagn 

cccccccccc 



tgcgatatat 
gaaaaatata 
gcaaagggct 
aaaattat ac 
gt atttaaga 
tttcagtctc 
caaggagcgc 
aantgaaaac 
ntgccangcg 
atgtttgtcg 
ttgcgaagca 
ttttgcttna 
aatgcttnta 
ntncct nccn 
ccc 



tttttagttt 
cactaagtga 
tcgttttttc 
tctgggctgt 
tgatatttta 
tcatcctgtt 
cgcaggcgna 
nccntaacnc 
ncnnattaaa 
gagaacgct c 
accgncccna 
angncctcct 
aaancct tt c 
tgtncantnt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
883 



<210> 46 
<211> 1024 
<212> DNA 
<213> E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (1024 ) 
<223> n = A, T, C or G 
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<400> 46 
gtttatggat aacggcaaag 
aaggaacgcc aatgaaaatt 
tggtggtgac tggcg.tattt 
aaataacgag aatat-ttcag 
tataatcact ggcgcaagga 
nccttcacna tncnnccncn 
nnncnnnnnn ncatnnnatc 
antncnaccn anntnacctt 
cccnttcnac nnccannnnn. 
ntcctanccn cnnacnnncc 
cnccnnttcc nnctnnncnc 
ccncncncnc ccnccatccc 
nnnnnncnnn tnnncncccn 
nntnaccnnt cctnncacnc 
tnnnnccata atcccacacc 
agctnnnnnt nctntnnnnc 
ccaccntccn cnncnnnntn 
tccc 



PCT/US00/02200 



ggcttcgt tt 
atactctggg 
aagatgatat 
tctctcatcc 
gcgcgcagag 
nantnnatag 
ccactnnntt 
a t acnnannc 
cancnnt cnn 
cancnatccn 
tnccncatnc 
nnnccnncnt 
gcncnnccnt 
tnccct nnng 
acncccntnc 
tnccncccnn 
nnncnancan 



tttcctatac 
ctgtattgat 
t ttaaaatta 
tgt tgcgctc 
tnctccnant 
nncaccnntn 
tnct ccannn 
nancnnnnnn 
ctnnnnccct 
nnnatacant 
ccnnnannan 
ccnnantnga 
nccntcacnc 
antccnacna 
ancntntnnt 
cnnactncnn 
ctnncacncn 



' 1 1 at tcagc.a 
tattttcctg 
attaatgtca 
ctgtcatgtg 
nnnnntnntt 
ttnntcnnnn 
nnncnnnntn 
nnccactctn 
nncntaattn 
cnattnntnn 
canntncccc 
caannnnaat 
tnnncnncta 
ntnnnnnanc 
ncntcntccc 
nnacccncnn 
cnantaacct 



ctcacaaata 
attgggctac 
tcaggtccga 
cattgcttca 
ntntnnctnn 
gnccncctcc 
canccnacaa 
nctcgnnctc 
tt ctnnctan 
cnntcncntn 
ncctncctna 
cncnnnnncn 
nanhnnntac 
nanaacnctn 
ttcntatcnc 
cccantcagt 
nntnncacct 



■ 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1024 



<210> 47 
<211> 236 
<212> DNA 
<213> E. Coli 

<400> 47 

atatacacta agtgaatgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 

gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 

tatactctgg' gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 

taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 

<210> 48 .. . ■■ 

<211> 418 
<212> DNA 
<213> E . Coli 

<220> 
. <221> misc_f eature 
<222> (1) . . . (418) 
<223> n - A, T , C or G 



60 
120 
180 
236 



<400 
cggagattac 
ataaaacgac 
ntcgtttatg 
ataaaggaac 
tactggtggt 
cgaaaataac 
tcatataat c 



> 48 
cgtgtgttgc 
atatccagaa 
gataacggca 
gccaatgaaa 
gactggcgta 
gagaatat tt 
actggcgcaa 



gatatatttt 
aaatatacac 
aagggcttcg 
attat actct 
t ttaagatga 
cagtctct ca 
ggagcgcgca 



ttagtt tcgc 
taagtgaatg 
ttttttccta 
gggctgtatt 
tat tttaaaa 
t cctgt tgcg 
nggggcggcc 



gtggcaatac 
atatcttccg 
tacttatt.ca 
gattattttc 
ttaattaatg 
ctcctgtcat 
aatcgccgcc 



atcagtggca 
attnatctt a 
gcactcacaa 
ctgattgggc 
tcatcaggt c 
gtgcat tgct 
ggcccctg 



60 
120 
180 
240 
300 
360 
418 



■ <210> 49 
<211> 550 
<212> DNA 
<213> E . Coli 

<400> 49 

ctgctagtta cagggaacac taatgacaga cagctaaaag ccctgtttaa ttacgtatta 60 

caaacagggg atgcccagcg ttttcgtgca tttattggtg agatagcgga acgcgcacca 120 

caagaaaagg agaaacrgat gaccattgct gacagattac gtgaagaagg cgcaatgcag 180 

ggcaaacacg aagaagccct gcgtattgct caggagatgc tggatagagg tttagacaga 240 

gagttagtta tgatggtgac ccgactttca ccagacgatc ttatcgcgca aagccactaa 300 
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tcctgtaaca ccgggagtta actggcggat gtttgctgta aaccacatca gcgaacgaca 

tccgccagcg cctcttctaa atcgtaccag cgaaacgcaa aacccgcttc ttccagccgt 

ttaggcagcg cgcgttgtcc acctaatacc agtactgaag attcgcccat taacagt.cga 

atggcggtcg cggggacgcg caaaatggcc gggcgatgca gcgcatgacc gagcgcatgg 
gcaaattgtt 

<210> 50 
<211> 99 
<212> DNA 
<213> E. Coli 



360 
420 
480 
540 
550 



<400> 50 

ttggcatctc ggtgttgccg atcttcatga tatccagccc gccggaaact tcttcccaaa 60 
cggttt'tgct gttatccatt. gagtcacgga actgcccct 99 

<210> -51 
<211> 259 
<212> DNA 
<213> E. Cold 

<220> 

<221> misc_f eature 
<222> (1) . . . (259) 
<223> n = A, T, C or G 

<400> 51 

ccgtgccgag atgatcctgt naccatcatc cgttgtgaag tagtgattca cgacttcaag 60 

gcgcttttca aaagggtatt ttggctttga catattaggg gctattccat ttcatcgncc 120 

aacaaaatgg gtgcagtaca tactcnttgg aaatcaacac aggaggctgg gaatgccgca 180 

gaaatataga ttactttctt taatagtgat ntgtttcacg cttttatttt tnaaanaagt 240 

tnggcttact tcccgggnn 259 



<210> 52 
<211> 877 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_feature 
<222> (1) . . . (877) 
<223> n = A, T, C or G 



<400> 52 

cagcagagcg cggccttctt cgtcagattt cgcagtagtg gtaatggtaa tatccaaacc 60 

acgaacgcgg tcgactttat cgtagtcgat ttctgggaag atgatctgct cacggacacc 120 

catgctgtag ttaccacgac cgtcgaaaga cttagcggac aggccacgga agtcacggat 180 

acgaggtaca gcaatagtga tcaggcgctc aaagaactcc cacatgcgtt cgccacgcag 240 

agttacttta cagccgatcg gatagccctg acggattttg aagcctgcaa cagatttgcg 300 

tgctttggtg atcagcggtt tttgaccgga gattgctgcc aggtctgctg ctgcgttatc 360 

cagcagtttt ttgtcagcga tcgcttcacc aacacccatg ttcagggtga tcttctcgac 420 

ccgagggact tgcatgacag aattgtagtt aaactcagtc atgagttttt taactacttc 480 

gtctttgtag taatcatgca gtttcgccat cgtactactc catgtcggtg aacgctctcc . 540 

tgagtaggac aaatccgccg ggagcggatt tgaacgttgc gaagcaacgg cccggagggt 600 

ggcgggcagg acgcccgcca taaactgcca ggcatcaaat taagcagaag gccatcctga 660 

cggatggcct ttttgcgttt ctacaaactc ttttggttat ttttctaaat cattcaaata 720 

tgtatccgnt catcccatcc tatcgatgat aagctgtcaa acatgagaat ttaatcaatc 780 

taaagtttta tggngttaaa cttgggctgg cagnttncca atggcttaat cagtngaggg 840 - 

ccctatntta acgaactngg ctantttngg tcaatcn 877 



<210> 53 
<211> 291 



-IS- 
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<212> DNA 
<213> E . Coli 

<400> 53 

tgaacagcag agatacggcc agtgcggcca ..atgttttttg tcctttaaac ataacagagt 60 

ccttraagga tatagaatag gggtatagct acgccagaat aucgtatttg attattgcta 120 

gtttrtagtt * ttgcttaaaa atattgttag ttttattaaa tgcaa'aacta aattattggt 180 

atca.gaatt tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 240 ' 

taggcttata aatgcgacta ccatgaagtt tttaattgaa agtattgggt t 291 

<210> 54 . - 

<211> 282 - • 

<212> DNA 

<213> E . Coli 

<4 00> 54 

ttatiaaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 60 

agggcggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 120 

. taatigaaag tattgggttg ctgataattt gagctgttct attct tttta aatatc'tata 180 

taggtctgtt aatggattt.t atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 240 

ccgccatatg aacggcgggt taaaatattt acaacttagc aa 282 

<210> 55 
<211> 293 
<212> DNA 
<213> E . Coli 

<220> 

<221> misc_f eature 
<222> (1 ) . . . (293) 
<223> n = A, T, C or G 

<400> 55 

cggggtccgg cgctcatcaa caatcggggg gcagcaaggg gctgaaacgg gaaagcccct 60 

cccgaagaag gggccttgta taaggaaagg gttatgatga agctcgtcat catactggtt 120 

gtgtngttac tgttaagttt cccgacttac taacaactca tcagaggggg gagaaatcct 180 

cccttaccct tgttccttta ctctaggttg aaaaaacaac agcgtcaata ggcctgccat 240 

gtacgaagcg agatctgtga accgctttcc ggttagcctt ttttatcctg ttg 293 

<210> 56 
<211> 300 
<212> DNA 
<213> E. Coli 

<400> 56 

tctgcgttcc gctaaaaggt gcaaargctc aggacgttgc. agcgttttgc gtgaccgctc 60 

ggggaaggca aaattgcctc tgggaaagca ttgcgcgggg tccggcgctc atcaacaatc 120 

ggggggcagc aaggggctga aacgggaaag cccctcccga agaaggggcc ttgtataagg 180 

aaagggttat gatgaagctc gtcatcatac tggttgtgtt gttactgtta agtttcccga 240 

cttac:aaca actcatcaga ggggggagaa atcctccctt acccttgttc ctttactcta 300 

<210> 57 

<211> 359 

<212> DNA 

<213> E. Coli 

<400> 57 

caacacagga ggctgngaat gccgcagaaa tatagattac tttctttaat agtgatttgt 60 

ttcacgcttt tatttttcac ctggatgata agagattcac tgtgtgaatt gcatattaaa 120 

caggagagtt atgagcrggc ggcgttttta gcctgcaaat tgaaagagta agagtcttcg 180 

gcgggaaatt attcccgcct tacttacggc gttgcgcatt ctcattgcac ccaaatttat 240 
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tcttcacaaa aataataata gattttatta cgcgatcgat tatttatttc ctgaaaacaa 300 
ataaaaaaat ccccgccaaa tggcagggat cttagattct gtgcttttaa gcagagatt 359 

<210> 58' 
<211> 700 
. <212> DNA 

<213>~ E. Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (700) 
<223> n = A, T, C or G 



<400> 
aaaccttttt 
caccaaaacg 
cttcaaggcg 
atcgtccaac 
tgccgcagaa 
cctggatgat 
cggcgttttt 
ttacttacgg 
agattttatt 
atggcaggga 
gctgccgggc 
gattgtaacc 



58 

ctcctgtttt 
tgccgagatg 
cttttcaaaa 
aaaatgggt g 
atatagatta 
aagagattca 
agcctgcaaa 
cgttgcgcat 
acgcgatcga 
tcttagattc 
ctttaacgcc 
atcggtctgg 



tcatagaggg 
atcctgtaac 
gggtattttg 
cagtacatac 
ctttctttaa 
ctgtgtgaat 
ttgaaagagt 
tctcattgca 
ttatttattt 
tgtgctttta 
get ttcgatg 
atagecnaga 



caacccatgt 
catcatcagt 
gctttgacat 
tcgttggaaa 
tagtgatttg 
tgcatattaa 
aagagtcttc 
cccaaatt ta 
cctgaaaaca 
agcagagat t 
gtgaaggaca 
aatgtccaac 



cctgacctgg 
tgtgaagtag 
at taggggct 
tcaacacagg 
tttcacget t 
acaggagagt 
ggcgggaaat 
t tctt cacaa 
aataanaaaa 
acaggctggt 
ctttctgacc 



gttcggggga 
tgattcacga 
attccatttc 
aggctgggaa 
ttatttttca 
tatgagct gg 
tattcccgcc 
aaataataat 
tccccgccaa 
tacgt t acca 
ttegtccaga 



60 
12 0 
18 0 
240 
300 
360 
420 
480 
540 
600 
660 
700 



<210> 59 
<211> 631 
<212> DNA ' 
<213> E. Coli 

<220> 

<221> misc_feature 
<222> (1) . . . (631) 
<223> n = A, T, C or G 



<400 
tggtggcatt 
caccacgggg 
acaacacagg 
egget ccggt 
agtaacgtgt 
caggggggaa 
cgttcaggcg 
agaccggtgc 
tgaaaatctt 
gttgt t cgat 
caaeggggae 



> 59 
ggt tgctgga 
gctaatcttg 
eggctatatg 
cattgetgge 
catgegggeg 
act ctgegge 
aacgtacggt 
tgtt tgtggc 
gecatcgaga 
gaeaateggt 
gtcngtgata 



gagagaaaac 
actctagacc 
acgttcgcag 
attcttgeca 
tcaggctgcc 
etttttegtt 
ttatcctggt 
aacccactgg 
acgcgaacca 
gccccctgat 
gaeegggcta 



ccccgcacgt 
actcaagaat 
agctgggcat 
gtatgatcgt 
gtaatggcaa 
ettactgegg 
attgaataac 
tgagtttttt 
ccagat egga 
gcggtgcctt 

g 



tgcaggtat g 
agecgegaaa 
ggcett ctgg 
gaactggctg 
tttgcgcccg 
gtaaggcacc 
tactgeattt 
ccagt caaca 
gatagecagg 
catgecgaag 



cacct 
cgttg 
catga 
aacaa 
gacca 
cagtc 
gagtt 
ttgtc 
aagct 
aattt 



gacaa 
tcatt 
tttag 
gcgga 
ggccg 
geege 
ctegg 
ttegg 
gctcg 
caccc 



. 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
631 



<210> 60 

.<211> 648 

<212> DNA 

<213> E . Coli 

<220> 

<221> misc^feature 
<222> (1) . . ..(648) 
<223> n = A, T, C or G 

<400> 60 
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ggctcaggcn 
atggagagaa 
.ttcaaaaaat 
gctgttgctg 
gtctggccca 
tgccaaccag 
ctgttgtctc 
tttagnctga 
gctctngtct 
natnngtaan 
atnntnnnan 
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tgctgattgt 
tcataaacgt 
t t gagggata 
cttgagggtc 
gcggaccagc 
cgcccaggag 
cagccattca 
tncanat tan 
ggtagttagc 
ttatntttnt 
nnnncagnnc 



ttttttgtgc 
ggtgaatgat 
aggccggaat 
gttttaacca 
atat t aggat 
cgcagactta 
cgacgactgg 
attnataaac 
nncggnnngr 
nngtctnant 
ntntttttta 



aatggcccng 
gattgttagc 
ggct ccggcc 
gacgccaggc 
ggcgaatcgt 
gcagcatatt 
cggaagggric 
gcagnanncn 
ctcnttntna 
tntanttgng 
aatnntttnt 



tattagcgtc 
aaggaaaact 
agagggaagt 
gctccatacg 
ccagatcgcc 
ccancgacga 
cgcgnctgac 
ggtntgatta 
cccnnttcnn 
tactntaagt 
nanncnnc 



gttgctgtcg 
gtcaaaaatc 
taaccgcgaa 
ccaaaaccgc 
atcacgctac 
t cgtaagcgc 
caacttgnct 
atcntatttn 
tttannttac 
ntatncgnnn 



<210> 61 
<211> 737 
<212>- DNA . 
<213> E. Coli 

<220> 

<221> misc_f eature . 
<222> (1) . . . (737) 
<223> n = A, T, C or G 



<400 
tgctaatat c 
aaacggaggc 
ttattttctg 
ataatccctc 
acggcgaaaa 
ttgcttttac 
cagttttctc 
ataaacatgc 
ccaatgacca 
atgttgatat 
acacctgcac 
cccccgtagc 
gggtgccgta 



> 61 
tttctcattg 
aaataatgct 
gttttctggc 
acctaaccgg 
aaaaccgcca 
gctaacaggc 
aatgaatgtt 
agcgatttct 
catccagagg- 
gcccttcaga 
gacatgcttc 
gataaaccgt 
atcttac 



agatgaaaat 
gggtaatatg 
cgcgtatttc 
ccccttgtta 
gtaaaccggc 
attttcctgc 
aaacggagct 
tccggtttgc' 
ctcttcagga 
atgtgtgatg 
ttcatcatta 
ttcgtaatna 



taaggtaagc 
aatgt-tttta 
agccacaaat 
cagttgtgta 
ggtgaatgct 
actgataacg 
t aaactcggt 
ttaccctcat 
aatgcgcgac 
gcatggttat 
gccgct ttga 
cgcgtccaac 



gaggaaacac 
tggccgtact 
gggatgacta 
caaggggcct 
tgcatggata 
aat cgttgac 
taatcacatt 
acatt gcccg 
tcacacctgc 
cgactaactg 
caataatgat 
tgggntaagt 



accacacca t 
gggaataatt. 
a tgaacggag 
gatttttatg 
gatttgtgtt 
acagtagcat 
ttgttcgtca 
gtccgctct t 
tgtcacggta 
gcaaattct g 
aaattcttcg 
aaagtt gcca 



<210> 62 

<211> 648 

<212> DNA 

<213> E . Coli 

<220> 

<221> misc__feature 
<222> (1) ... (648) 
<223> n = A, T, C or G 



<400 
tgcttttgaa 
aaaaccgata 
aactctttcg 
cgagtggctc 
ctctcaactg 
agaccaccga 
tccgatatcc 
gcctccgggt 
tggaagttcg 
cttaaccccc 
aaagccnact 



> 62 
tatgtgctcg. 
atctgaaaga 
gattcaccca 
gt tagcaaaa 
taaggggatt 
agccccggag 
at aaaagaag 
tgtaaatacc 
ctgatgtccg 
cgtgggccca 
gnnaattggt 



caatcttgag 
acccaagtat 
gaaagtgaan 
agttt caaca . 
gaaatggtaa 
gtggt gaaat 
tcgggtct tt 
gact t t gctg 
ccctttttaa' 
gttaaaagtc 
tacctggt tt 



aaggaaatgg 

ttcagtataa 

ccaaaa tgat 

a tggagtaaa. 

ccccagctct 

aaaaccgggc 

gtctggtaaa 

ggtgtagcct 

agggaatttt 

at ggtaagnc 

gcaagtancc 



cgaccacgaa 
gcattgaatg 
aatcgtatac 
tacatccaac 
t cgct tgagg 
acaacacgaa 
attaaattgg 
ggcggcatca 
ggtgatgccg 
ctaatnggtt 
ctggaagg 



agaaaaggca 
ccgaccagta 
ataagtcttt 
at atcaataa 
ggtatagccg 
agggcgcat t 
tgggaagtgc 
agtttttttc 
gtgaatgccg 
tggggtggga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
648 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
737 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
648 



<210> 63 
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<211> 237 

<212> DNA ■ • 

<213> E. Coli 

<220> 

<221> misc__feature 
<222> (1) . . . (237) 
<223> n = A, T, C or G 

<400> 63 

ggtgtttant tacaagagat tcatctttgt ntaaancccn gataagtaat tacgcataaa 

acaacaatga ttataatagc aaaaataaat attatcatct ttgatagatt acttgagata 

gccagcatct tgtaaagcct ttatcgtttt tttatgctct ggattaatat aatcactaca 

tctatctgag ' caatctgttg ttgatggaca tgtcaaccca tggtcattta cagccaa 

<210> 64 

<211> 427 

<212> DNA 

<213> E. Coli 



60 
120 
180 
237 



<400 
gataattaga 
aatcatgcca 
cttcaaaatg 
aggaatgccg 
tagccttgcc 
tggcgcacaa 
ggcggtggta 
gctgatg 



> 64 
gtttgtcgtc 
ttagtcatrg 
aacggcttca 
ctggataaag 
ctgatcatgg 
cgtatcgcca 
ctgaccggtt 



agaaa 
ttgct 
tcgct 
ttatt 
gtttt 
ccacg 
tt acc 



att ga 
a t egg 
ctegt 
ggct c 
ggege 
ctgat 
gttgg 



cgttacccat 
tgtaatcttg 
cctcgtggcg 
cat caaagee 
aatgctgggc 
tgccaaattt 
ttttgccctg 



aacaaatgaa 
ttgttgctcc 
cttgctgttg 
ggtgtcggcg 
aaaatgctgg 
ggtaaaaaac 
ttctatgaag 



aggecaggta 
tgatgatccg 
gattaatgea 
ggaegctegg 
cagactgegg 
acatccagtg 
tgggctttgt 



60 
120 
180 
240 
300 
360 
420 
4 27 



<210>-65 
<211> 261 
<212> DNA 
<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1). . . . (261) 
<223> n. = A, T, C or G 

<400> 65 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 
aattaatgea attatcgaaa attcaaaaaa tatccaaaaa tngtatactt tattccagaa 
gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgetcat 
tacattgtga gcttcatctt tatttaattt tctgttgact ccagctctcc gtgataaegg 
ttttataatt agatgettat c 

<210> 66 

<211> 98 J 

<212> DNA 

<213> E . Coli 



60 
120 
180 
24 0 
261 



• <400> 66 

agatgattgc egggaacttg ttageggcac geaggeggeg gctcgcaccc ttaccctgct 
etttaegtae ttctgcgttg atagtaaaca tttctttc 

<210> 67 
<211> 260 
<212> DNA 
<213> E . Coli 



60 
98 
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' <400> 67 

aagcgcgaac gaagtcgatg tgctgcagct tcggtttgta cgggtgacgc tgtacgtcct 60 

gagctrtaac ttrgatttct ttaccgtcaa caacgatggt cagaacttcg ctgtagaatt 120 

cagctrtagc ttgcatgttc atgactttgt cgtgatccag ctcgatagcc agcggcgctt 180 

ctttgccacc gtagatgatt gccgggaact tgttagcggc acgcaggcgg cggctcgcac 240 

ccttaccctg ctctttacgt 260 



<210> 68 
•<211> 95 
<212> DNA 
<213> E . Coli • 

<4O0>.68 

aaaaacggcg 'taaagaaagg ttgcaaacat gttaataaaa actcaaattg atcccacgta 60' 
tatatracgc cgcaaaatcc ttacaataaa caggg 95 

<210> 69 
<211> 174 
<212> DNA 
<213>' E. Coli 

<400> 69' 

ttaatiatta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 60 

agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 120 

acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 174 

<210> 70 

<211> 138 

<212> DNA 

. <213> E . Coli 

<400> 70 ■' 

agtctgtaaa aacgtcaaaa agagtgtttt atcaacagaa gaatggaggt ctgacagata 60. 

gtagtaatgc aaaaaaatgg agacttaagt tgaatgaacg ggagtaaagc gaaaagacta 120 
tagagtgaag gagaaatt 138 

<21Q> 71 

<211> 191 

<212> DNA 

<213> Z . Coli 

<400> 71 

-tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg . 60 

gttttrggag agaagaatga ggaagatgcg tcgagccaca gaaacgtta.g ctttacatat 120 

agcg.gaggtg atgtgaattt aatttacaat agaaataatt tacatatcaa acagttagat 180 

gctttttgtc g 191 

<210> 72 
<211> 244 
<212> DNA 
<213> E. Coli 

<400> 72 

ggccaittat acaggaaaag cctatgtcag aacgtaaaaa ctcaaaatca cgccgtaatt 60 

atctcgttaa atgttcctgc ccaaactgca cccaagagtc agaacacagt ttttcaagag 120 

tacaaaaagg tgcccttttg atctgccctc attgcaacaa agtattccag acaaatctta 180 

aagctgtagc ctgattgatt ttattagtaa caagtatttt ttatatttta ataatatatt 240 

taaa 244 



<210> 73 
<211> 327 
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60 
120 
180 
240 
300 
327 



WO 00/44906 

<212> DNA 
<213> E . Coli 

<220> 

<221> misc_f eature 
<222> (1) . . . (327) 
<223> n = A,T,C or G 

<400> 73 

aaattttcag gtaccttgtc accatacttt tttttctgag cattaatgat attttgagct 

tcttgaggat ctttaactcc. ccacatttgg tggaaagtat tcatattaaa aggaaggntg 

aataatttgn ctttataaat cgccagtgga gaattagtaa aacgattaaa ttctactaaa 

tnattaaccg naaaaaaatt cccatatata tttatcattg gtatgaaaaa tatgtgcacc 

atatttatga atntggatac cctnacagtc ctctgtgtac gcatttccac cgatatgatt 
tcttttctna atcactaaaa ctt-tttt 

<210> 74 

<211> 150 

<212> DNA 

<213> E. Coli 

<400> 74 

gcagtgatcg aagcgatgac gaagrgtatg gaaaaatcag aaaaactcag caaatcctga 60 

tgactttcgc cggacgtcag gccgccactt cggtgcggtt a'cgtccggct ttctttgctt 120 

tgtaaagcgc caaatctgcc gatttcaacc 150 

<210> 75 

<211> 330 

<212> DNA 

<213> E . Coli 

<400> 75 

gaaagtatct tcgttattga catcactgga aaatataact tgcttttcat tattaaactc 60. 

gaagcgcgta ccgtatctgg acaaacattt atcgagctta ccaaattcct gaagaggttt 120 

aactacagat aacatttgcg cgtcctttgc agtaatgccc gtcaaatcct tgacgggcat 180 

tatttagatt a'aattaccag tatttcttcg gagtgaagaa tattaccagg tatatttaac 240 

acccacgttc gcggaccagt cttgatctac gtcaccacca ccgaggtagt tagcatcggt 300 

ataggcgctg aagttcttgg " tgaagctaaa 330 

<210> 76 s 

<211> 194 

<212> DNA 

<213> E. Coli " 

<400> 76 

tgtttttttc cagcaacgga gcaaaaggtt tgcccttgtg cagctcaggg ttaaccactt 60 

taacracgtg gcgacgaccc ggagatgtcg gtttacattt aacaactgcc attgtattac 120 

tcctccgact tactcagcgc cgccaacgaa gtccagattc tggccttctt tcagggtgac 180 

gtaagctttt ttcc 194 

<210> 77 

<211> 188 

<212> DNA 

<213> E . Coli 



<400> 77 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 60 

cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtactatg gtgttggatt 120 

tttccatcgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 180 

gttcttca ~ 188 
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<210> 78 
<211> 173 
<212> DNA 
" <213> E. Coli 

<400> 78 

acaaaggcga acaaagcctg tgaagcccga aggctccaca gacagtgcta cttgaaggcc 60 

ttactgtttc ttcttaggag cgagcaccat gatcatctgg cggccttcga tcttggttgg 120 

gaaggattcg accactgcca gttcttgcaa atcgtctttc acgcgattaa gca * 173 

<210> 79 

<211> 272 

<212> DNA 

<213> E. Coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (272) 
<223> n = A, T, C or G 



<400> 79 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 60 

gaactgatcc cgctttctgc ttcaagcttc tgaactggat acggaaacgt aatnagggct 120 

aaagaagaca ctactcttag ccctttaaca tttaacgcat tgtcacgaac tcttctgccg 180 

ccgttgggtg aatggcgacg ggtattggtc gaaatctttt ttgggtggcc ccatctttaa 240 

cgcccacccg cgaaaccctg caacatttcg tc 272 



<210> 80 
<211> 259 • 
<212> DNA 
<213> E. Coli 



<400> 80 

cgcaggcagc tgatggtcaa caggatgaga gaaacccaga gacaggttaa tcacattgcc 60 

tttaaccgct gcacggtaac ctacaccaac cagctgcagc ttcttagtga agccttcggt 120 

aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 180 

tgcgtaac'ca tcacgcggac c.gaaggtcag ggtattatct gcatgtttaa cttcaacagc 240 

atcgttgaga gtacgagtc 1 259 



<210> 81 
<211> 73 
<212> DNA 
<213> E . Coli 



<400> 81 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 
ccgtttaccg ggg 73 

<210> 82 
<211> 666 
<212> DNA 
<213> E. Coli 



<400> 82 

atgaacgttt tctcgcaaac tcaacgctat aaggcgttgt tctggttatc gttatttcat 60 

ctgctggtga tcacctccag taactatctg gttcagcttc ccgtctccat tttgggtttc 120 

cataccacct ggggcgcgtt tagctttccg tttatttttc ttgctaccga cctgaccgtg 180 

cgtatttttg gcgcaccgct ggcccgacgc attatcttcg cggtaatgat ccctgcgtta 240 

ttaatctcct acgtcatctc gtcgctattc tatatgggtt cctggcaggg attcggcgca 300 

ctcgcccact tcaacctgtt tgtcgcccgt atcgccaccg ccagtttcat ggcctacgcg 360 

ctggggcaaa tcctcgacgt gcacgttttt aaccgcctgc gtcagagtcg ccgctggtgg 420 
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ctggcaccga cagcgtccac actgttcggt aacgtcagcg acacgctggc ctttttcttc 
attgccttct ggcgtagccc ggatgccttt atggctgaac actggatgga aatcgcgctg 
gtcgattact gtttcaaagt gttaatcagt atcgttttct tcctgccaat gtatggcgta 
ttactcaata tgctgttgaa aagactggca gataaatccg aaatcaacgc tttgcaggcg 
agttaa 

<210> 83 
<211> 612 
<212> DNA 
<213> E . Coli 



480 
540 
600 
660 
666 



<400 
gtgataaga t 
aat ctggtta 
aaagacacta 
gtcaacctgc 
ggtaagctgg 
gcagtcatcg 
ctggaaga tc 
gagctgaaag 
gcgtactctt 
ctgcccgct g 
gttattcagt 



> 83 
ggatgaatga 
aatatgt egg 
acgctacggc 
ttgatggcaa 
gaaegcagge 
tcatcatggg 
ageaaegtag 
gt cacaaaat 
ccgttattct 
acgatcagca 
aa 



<210> 84 

<211> 975 

<212> DNA 

<213> E. Coli 



gccgttatgg 
aattggcctg 
gcagggttcg 
gttaagtttc 
caataacatg 
cgatgatccg 
ccgcgatccg 
gcagcagtta 
gggtaacgtg 
aaaagcgcag 



ccgtttatcg 
ctggttatgg 
gtcgcggaaa 
tcgctgccag 
catgtctggt 
aaagaagatc 
cagctgeaag 
gacagtatta 
ggtaatcaac 
accaccgcag 



aaaggaagaa 
ggcttgegge 
gtaaegctae 
eggatatgae 
ccgacgccac 
tggcggtgct 
tggtaaccaa 
tctccgcgaa 
tgctgaccat 
aaaacatcat 



gtcaatgege 
ctgtgatgat 
cgggaatccc 
cgaccagagc 
egggcagaaa 
ggcgaagcgt 
taaagecatt 
aggecagacg 
gcaaattacg 
taatacgctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
612 



<400> 84 

atggcgaata tgtttgccct gattctggtg attgccacac tggtgacggg 

tgcgtggata aattcttttt c'gcacctaaa eggegggaac gtcaggcagc 

getgeegggg actcactgga taaagcaacg ttgaaaaagg ttgcgccgaa 

ctggaaaccg gtgcttctgt' ttttccggta ctggctatcg tattgattgt 

atttatgaac cgttccagat cccgtcaggt tegatgatge cgactctgtt 

tttattctgg t agagaagtt - t gcttatggc attaaagatc ctatctacca 

atcgaaaccg gtcatccgaa aegeggegat atcgtggtct ttaaatatcc 

aagcttgatt acatcaagcg cgcggtgggt ttaccgggcg ataaagtcac 

gtctcaaaag agctgacgat teaaceggga tgcagttccg gecaggegtg 

ctgccggtca cctactcaaa cgtggaaccg agegattteg ttcagacctt 

aat'ggtgggg aagcgaccag eggattcttt gaagtgccga aaaacgaaac 

ggaattcgtc tttccgagcg taaagagaca ctgggtgatg tgacgcaccg 

gtgccgattg egcaggatea ggtggggatg tattaccagc agecagggea 

acctggattg ttcctccggg acaatacttc atgatgggcg acaaccgcga 

gaeagcegtt actggggctt tgtgccggaa gcgaatctgg teggteggge 

tggatgagct tcgataagca agaaggegaa tggccgactg gtctgegett 

ggcggcatcc attaa 



cattttatgg 
ggcgcaggcg 
gcctggctgg 
gcgttcgttt 
aattggtgat 
gaaaacgctg 
ggaagatcca 
ttacgatccg 
tgaaaacgcg 
ctcacgccgt 
caaagaaaat 
cattctgaca 
acaactggca 
caacagcgcg 
aaeggctate 
aagtegcatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
975 



<210> 85 

<211> 1761 • 

<212> DNA 

<213> E . Coli 



<400 
t tgaccatta 
atatttgetc 
eggctgeaat 
gcgaaggccc 
caaagegatg 
ttacgt cacg 
gactgggtag 



> 85 
egaaacttge 
agecacattt 
ttgcgctgga 
eggaagagt c 
caggccaact 
cagtgagtgc 
aageggagea 



atggcgtgac 
gattgacgaa 
gcagttgctg 
tgagt atctg 
ggtgggcggt 
agatgataat 
actctttggc 



ctggttcctg 
aacgatcctt 
catacgegag 
aatcttattg 
cactatgagg 
tt tgegaett 
tgcctgcgcc 



ataccgatag 
tattcagtga 
catcctcctc 
ccaat geege 
ttt ccggcca 
taacgcaagt 
agtttaatgg 



ctatcaggaa 
tactcaaccg 
ttttatgctg 
gegtaegcta 
ct ccatccgc 
tgtcgctgcc 
cgacattacc 



60 
120 
180 
240 
300 
360 
420 
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ctgcagccig 
ctgctggcgc 
gactgggttg 
aagctgaaag 
ccagagcttt 
gcggagtcag 
cctgcaccgg 
gaacaagaaa 
tccctgtgtg 
gaatg'gcgcg 
ctgattgaaa 
ccgggtcatc 
gatggtgaat 
gggatgatga 
ttctcagcat 
atggctgaac 
gctatcacag 
aaaat cgaag 
attatcccca 
gaagaaggca 
ttaaatctgg 
atcgcgcaag 
aactggttta 



gtctggtgca 
aacctctgct 
cgtttgatga 
tcattctggt 
cagagcaggc 
taacccagtg 
gagcggatgc 
cactt ccgct 
atggcgacac 
aaggtttcct 
ccgaaggcga 
cacgcgcttt 
tcaccgacat 
tcatgcaagc 
cgctgacatt 
tctgcgccct 
gttcagtcga 
gcttctttgc 
cagctaacgt 
aat tcaccat 
tgtgggatgg 
catcgcaaca 
ttccgaactg 



t caggcaaat 
gtggatgcgg 
gtcgcgccct 
aggcgaacgc 
ta tt tatagc 
gtgtcgctgg 
ctggccgata 
tagcccgcag 
cttct ccggc 
cgctgaacgt 
acgcat cggg 
tggcgaacct 
cgaacgcaaa 
gttcctgatg 
tgagcagtca 
gataagcgcc 
tcagtt cggt 
tat ttgccag 
tcgccattta 
ctgggcagta 
cgaaggccaa 
ggaaggacgt 
a 



ggcggtattc 
ctgaaaaata 
ctccccgtct 
gaatcattgg 
gaatttgaag 
gtgacattta 
cttatccgcg 
tggatcctcc 
gagcagctaa 
atgcaggatg 
caaattaacg 
tctcgcatta 
gcggagcttg 
t cggaactac 
tacagtgaag 
ctcgccgatg 
cgcgcccagc 
caacgtgagt 
agtcttcaca 
gacgatgtga 
acgacgctga 
caccgttttc 



tcattatctc 
tcgttaaccg 
ctgtgccttc 
ctgatttcca 
atactctgca 
ccgccagaca 
aagcagcacg 
gccagtgtaa 
acttaatgct 
agatccttca 
ccctttcggt 
gctgcgttgt 
gcggcaatat 
agct tgagca 
ttgatggaga 
tgccggtgaa 
cggtcggtgg 
taaccgggaa 
gtgaactggt 
ctgacgcact 
tgcaaaccat 
catggccat t 



tt tgcgtaca 
cgagcgttt t 
gatgccattg 
ggagatggag 
gattgtcgat 
taatcactta 
ctacaccggt 
agaggtcgcc 
gcagcagcgt 
ggagcaaatc 
cattgaattt 
gcatattggc 
ccatgcgaaa 
acagat cccc 
tagtgcctcg 
t cagagtat c 
tttaaatgag 
acaaggtgtc 
gaaagcggta 
gccgttatta 
ccaggaacgt 
acgttggctg 



480 
540 
. 600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1761 



' <210> 86 
<211> 1185 
<212> DNA 
<213> E . Coli 

<400> 86 

g.tgtctaaag aaaaatttga acgtacaaaa ccgcacgtta acgttggtac tatcggccac 60 

gttgaccacg gtaaaactac tctgaccgct gcaatcacca ccgtactggc taaaacctac 120 

ggcggtgctg ctcgtgcatt cgaccagatc gataacgcgc cggaagaaaa agctcgtggt 180 

atcaccatca acacttctca cgttgaatac gacaccccga cccgtcacta cgcacacgta 240 

gactgccegg ggcacgccga ctatgttaaa aacatgatca ccggtgctgc tcagatggac 300 

ggcgcgatcc tggtagttgc tgcgactgac ggcccgatgc cgcagactcg tgagcacatc . 360 

ctgctgggtc gtcaggtagg cgttccgtac atcatcgtgt tcctgaacaa atgcgacatg 420 

gttgatgacg aagagctgct ggaactggtt gaaatggaag ttcgtgaact tctgtctcag 480 

tacgacttcc cgggcgacga cactccgatc gttcgtggtt ctgctctgaa agcgctggaa 540 

ggcgacgcag agtgggaagc gaaaatcctg gaactggctg gcttcctgga ttcttatatt 600 

ccggaaccag agcgtgcgat tgacaagccg ttcctgctgc cgatcgaaga cgtattctcc 660 

atctccggtc gtggtaccgt tgttaccggt cgtgtagaac gcggtatcat caaagttggt 720 

gaagaagttg aaatcgttgg tatcaaagag actcagaagt ctacctgtac tggcgttgaa 780 

atgttccgca aactgctgga cgaaggccgt gctggtgaga acgtaggtgt tctgctgcgt 840 

ggtatcaaac gtgaagaaat cgaacgtggt caggtactgg ct'aagccggg caccatcaag 900 

ccgcacacca agttcgaatc tgaagtgtac attctgtcca aagatgaagg cggccgtcat 960 

actccgttct tcaaaggcta ccgtccgcag ttctacttcc gtactactga cgtgactggt 1020 

accatcgaac tgccggaagg cgtagagatg gtaatgccgg gcgacaacat caaaatggtt 1080 

gttaccctga tccacccgat cgcgatggac gacggtctgc gtttcgcaat ccgtgaaggc 1140 

ggccgtaccg ttggcgcggg cgttgttgct aaagttctgg gctaa 1185 

<210> 87 
<211> 2115 
<212> DNA 
<213> E . Coli 



<400> 87 
atggctcgta caacacccat 
gccggtaaaa ccactactac 
ggtgaagttc atgacggcgc 
attaccatca cttccgctgc 



cgcacgctac cgtaacatcg gtatcagtgc gcacatcgac 60 

cgaacgtatt ctgttctaca ccggtgtaaa ccataaaatc 120 

tgcaaccatg gactggatgg agcaggagca ggaacgtggt 180 

gactactgca ttctggtctg gtatggctaa gcagtatgag 240 
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ccgcatcgca tcaacatcat cgacaccccg gggcacgttg acttcacaat cgaagtagaa 300 

cgttccatgc gtgttctcga tggtgcggta atggtttact gcgcagttgg tggtgttcag 360 

ccgcagtctg aaaccgtatg gcgtcaggca aacaaatata aagttccgcg cattgcgttc 420 

gttaacaaaa tggaccgcat gggtgcgaac ttcctgaaag ttgttaacca gatcaaaacc 480 

cgtctgggcg cgaacccggt tccgctgcag ctggcgattg gtgctgaaga acatttcacc 540 

ggtgttgttg acctggtgaa aatgaaagct atcaactgga acgacgctga ccagggcgta 600 

accttcgaat acgaagatat cccggcagac atggttgaac tggctaacga atggcaccag 660 

aacctgatcg aatccgcagc tgaagcttct gaagagctga tggaaaaata cctgggtggt 720 

gaagaactga ctgaagcaga aatcaaaggt gctctgcgtc agcgcgttct ga'acaacgaa 780 

atcatcctgg taacctgtgg ttctgcgttc aagaacaaag gtgttcaggc gatgctggat 840 

gcggtaattg attacctgcc atccccggtt gacgtacctg cgatcaacgg tatcctggac 900 

gacggtaaag acactccggc tgaacgtcac gcaagtgatg acgagccgtt ctctgcactg 960 

gcgttcaaaa tcgctaccga cccgtttgtt ggtaacctga ccttcttccg tgtttactcc 1020 

ggtgtggtta actctggtga- taccgtactg aactccgtga aagctgcacg tgagcgtttc 1080 

ggtcgtatcg ttcagatgca cgctaacaaa cgtgaagaga tcaaagaagt tcgcgcgggc 1140 

gacatcgctg ctgctatcgg tctgaaagac gtaaccactg gtgacaccct gtgtgacccg 1200 

gatgcgccga tcattctgga acgtatggaa ttccctgagc cggtaatctc catcgcagtt 1260 

gaaccgaaaa ccaaagctga ccaggaaaaa atgggtctgg ctotgggccg tctggctaaa 1320 

gaagacccgt ctttccgtgt atggactgac gaagaatcta accagaccat catcgcgggt 1380 

atgggcgaac tgcacctcga catcatcgtt gaccgtatga agcgtgaatt caacgttgaa 1440 

gcgaacgtag gtaaaccgca ggttgcttac cgtgaaacta tccgccagaa agttaccgat 1500 

gttgaaggta aacacgcgaa acagtctggt ggtcgtggtc agtatggtca tgttgttatc 1560 

gacatgtacc cgctggagcc gggttcaaac ccgaaaggct acgagttcat caacgacatt 1620 

aaaggtggtg taatccctgg cgaatacatc ccggccgttg ataaaggtat ccaggaacag 1680 

ctgaaagcag gtccgctggc aggctacccg gtagtagaca tgggtattcg tctgcacttc 1740 

ggttcttacc atgacgttga ctcctctgaa ctggcgttta aactggctgc ttctatcgcc 1800 

tttaaagaag gctttaagaa agcgaaacca gttctgcttg agccgatcat gaaggttgaa 1860 

gtagaaactc cggaagagaa caccggtgac gttatcggtg acttgagccg tcgtcgtggt 1920 

atgctcaaag gtcaggaatc tgaagttact ggcgttaaga tccacgctga agtaccgctg 1980 

tctgaaatgt tcggatacgc aactcagctg cgttctctga ccaaaggtcg tgcatcatac 2040 

actatggaat tcctgaagta tgatgaagcg ccgagtaacg ttgctcaggc. • cgtaattgaa 2100 

gcccgtggta aataa --Y 2115 

<210> 88 
<211> 540 
<212> DNA 
<213> E. Coli 



<400> 88 

atgccacgtc gtcgcgtcat tggtcagcgt aaaattctgc cggatccgaa gttcggatca 60 

gaactgctgg ctaaatttgt" aaatatcctg atggtagatg gtaaaaaatc tactgctgaa 120 

tctatcgtat acagcgcgct ggagaccctg gctcagcgct ctggtaaatc tgaactggaa 180 

gcattcgaag tagctctcga aaacgtgcgc ccgactgtag aagttaagtc tcgccgcgtt » 240 

ggtggttcta cttatcaggt accagttgaa gtccgtccgg ttcgtcgtaa tgctctggca 300 

atgcgttgga tcgttgaagc tgctcgtaaa' cgcggtgata aatccatggc tctgcgcctg 360 

gcgaacgaac tttctgatgc tgcagaaaac aaaggtactg cagttaagaa acgtgaagac 420 

gttcaccgta tggccgaagc caacaaggcg ttcgcacact accgttggtt atcccttcgg 480 

agttttagtc accaggcggg cgcttccagt aagcagcccg ctttgggcta cttaaattga 540 



<210> 89 

,<211> 1549 

<212> DNA 

<213> E. Coli 

<400> 89 

aaattgaaga gtttgatcat ggctcagatt 

gtcgaacggt aacaggaagc agcttgctgc 

tgtctgggaa ' gctgcctgat ggagggggat 

aatgtcgcaa gaccaaagag ggggaccttc 

ggattagctt gttggtgggg taacggctca 

ggatgaccag ccacactgga acrgagacac 



gaacgctggc ggcaggccta 

ttcgctgacg agtggcggac 

aactactgga aacggtagct 

gggcctcttg ccatcggatg 

ccaaggcgac gatccctagc 

ggtccagact cctacgggag 



acacatgcaa 60 

gggtgagtaa 120 

aataccgcat 180 

tgcccagatg 240 

tggtctgaga 300 

gcagcagtgg 360 
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ggaatattgc 
tcgggttgta 
gacgttaccc 
ggtgcaagcg 
agatgtgaaa 
cgtagagggg 
cggtggcgaa 
aaacaggatt 
ccttgaggcg 
aaggttaaaa 
ttcgatgcaa 
gaatgtgcct 
aaatgttggg 
ccgggaactc 
catcatggcc 
gacctcgcga 
ctcgactcca 
ttcccgggcc 
tagct taacc 
acaaggtaac 



acaatgggcg 
aagtactttc 
gcagaagaag 
ttaatcggaa 
tccccgggct 
ggtagaattc 
ggcggccccc 
agat accctg 
tggcttccgg 
ctcaaatgaa 
cgcgaagaac 
tcgggaaccg 
ttaagtcccg 
aaaggagact 
cttacgacca 
gagcaagcgg 
tgaagt cgga 
ttgtacacac 
ttcgggaggg 
cgtaggggaa 



caagcctgat 
agcggggagg 
caccggctaa 
ttactgggcg 
caacctggga 
caggtgtagc 
tggacgaaga 
gtagtccacg 
agctaacgcg 
ttgacggggg 
cttacctggt 
tgagacaggt 
caacgagcgc 
gccagtgata 
gggctacaca 
acctcataaa 
atcgctagta 
cgcccgtcac 
cgcttaccac 
cctgcggttg 



gcagccatgc 
aagggagtaa 
ctccgtgcca 
taaagcgcac 
actgcatctg 
ggtgaaatgc 
ctgacgctca 
ccgtaaacga 
tt aagt cgac 
cccgcacaag 
cttgacatcc 
gctgcatggc 
aacccttatc 
aactggagga 
cgtgctacaa 
gtgcgtcgta 
atcgtggatc 
accat gggag 
tttgtgattc 
gatcacctcc 



cgcgtgtatg 
agttaatacc 
gcagccgcgg 
gcaggcgggt 
atactggcaa 
gtagagatct 
ggtgcgaaag 
tgtcgacttg 
cgcctgggga 
cggtggagca 
acggaagttt 
tgtcgtcagc 
ctttgttgcc 
aggtggggat 
tggcgcatac 
gtccggattg 
agaatgccac 
tgggttgcaa 
atgactgggg 
ttaccttaa 



aagaaggcct 
tttgctcatt 
taatacggag 
tggttaagtc 
gcttgagtct 
ggaggaatac 
cgtggggagc 
gaggttgtgc 
gtacggccgc 
tgtggt'ttaa 
tcagagatga 
tcgtgttgtg 
agcggtccgg 
gacgtcaagt 
aaagagaagc 
gagtct gcaa 
ggtgaatacg 
aagaagtagg 
tgaagtcgta 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1549 



<210> 90 
<211> 375 
<212> DNA 
<213> E . Coli 



<400 
atggcaacag 
gtgcctgcgc 
actcctaaaa 
ttcgaagtga 
ctgatccgtg 
gcgcttgact 
cgtcctaagg 



> 90 
tt aaccagct 
tggaagcatg 
aaccgaactc 
cttcctacat 
gcggtcgtgt 
gctccggcgt 
cttaa 



ggtacgcaaa 
cccgcaaaaa 
cgcgctgcgt 
cggtggtgaa 
t:aaagacctc 
taaagaccg.t 



ccacgtgctc 
cgtggcgtat 
aaagtatgcc 
ggtcacaacc 
ccgggtgttc 
aagcaggctc 



gcaaa'gttgc 
gtactcgtgt 
gtgttcgtct 
tgcaggagca 
gttaccacac 
"gttccaagta 



gaaaagcaac 
atatactacc 
gactaacggt 
ctccgtgatc 
cgtacgtggt 
tggcgtgaag 



<210> 91 
<211> 366 
<212> DNA 
<213> E . Coli 

<400> 91 
atgtctatca ctaaagatca' 
gtagaactga tctctgcaat 
gtagctgctg gcccggttga 
gctgctggcg ctaacaaagt 
ctgaaagaag ctaaagacct 
aaagacgacg cagaagcact 
aaataa 

<210> 92 
<211> 498 
<212> DNA 
<213> E . Coli 



aat cattgaa 
ggaagaaaaa 
agctgctgaa 
tgctgttat c 
ggtagaatct 
gaaaaaagct 



gcagttgcag 
ttcggtgttt 
gaaaaaactg 
aaagcagtac 
gcaccggctg 
ctggaagaag 



ctatgtctgt 
ccgctgctgc 
aat tcgacgt 
gtggcgcaac 
ct ctgaaaga 
ctggcgctga 



aatggacgtt 
tgctgtagct 
aattctgaaa 
tggcctgggt 
aggcgtgagc 
agttgaagtt 



60 
120 
180 
240 
300 
360 
375 



60 
120 
180 
240 
300 
360 
366 



<400> 92 



atggctttaa 
ggcgcgctgt 
ctgcgtaaag 
cgccgtgctg 
ctgattgcat 
aaagcgaatg 



atcttcaaga 
ctgcagtagt 
caggtcgcga 
ttgaaggtac 
act ctatgga 
caaaatttga 



caaacaagcg 
tgcggattcc 
agctggcgta 
t ccgt t cgag 
acacccgggc 
ggtcaaagcc 



attgttgctg 
cgtggcgtaa 
tacatgcgtg 
tgcctgaaag 
gctgctgct c 
gctgcct ttg 



aagtcagcga 
ctgtagataa 
ttgttcgtaa 
acgcgtttgt 
gtctgttcaa 
aaggtgagct 



agtagccaaa 
aatgactgaa 
caccctgctg 
tggtccgacc 
agagttcgcg 
gatcccggcg 



60 
120 
180 
240 
300 
360 
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tctcagatcg accgcctggc aactctgccg acctacgaag aagcaattgc acgcctgatg 420 

gcaaccatga aagaagcttc ggctggcaaa ctggttcgta ctctggctgc tgtacgcgat 480 

gcgaaagaag ctgcttaa ~ 498 

<210> 93 
<211> 2145 
<212> DNA 
<213> E . Coli 

<400> .93 

gtgtcccgta ttattatgct gatccctacc ggaaccagcg tcggtctgac cagcgtcagc 60 

cttggcgtga tccgtgcaat ggaacgcaaa ggcgttcgtc tgagcgtttt caaacctatc 120 

gctcagccgc gtaccggtgg cgatgcgccc gatcagacta cgactatcgt gcgtgcgaac 180 

tcttccacca cgacggccgc tgaaccgctg aaaatgagct acgttgaagg tctgctttcc 240 

agcaatcaga aagatgtgct gatggaagag atcgtcgcaa actaccacgc taacaccaaa 300 

gacgctgaag tcgttctggt tgaaggtctg gtcccgacac gtaagcacca gtttgcccag 360 

tctctgaact acgaaatcgc taaaacgctg aatgcggaaa tcgtcttcgt tatgtctcag 420 

ggcactgaca ccccggaaca gctgaaagag cgtatcgaac tgacccgcaa cagcttcggc 480 

ggtgccaaaa acaccaacat caccggcgtt atcgttaaca aactgaacgc accggttgat 540 

gaacagggtc gtactcgccc ggatctgtcc gagattttcg acgactcttc caaagctaaa 600 

gtaaacaatg ttgatccggc gaagctgcaa gaatccagcc cgctgccggt tctcggcgct 660 

gtgccgtgga gctttgacct gatcgcgact cgtgcgatcg atatggctcg ccacctgaat 720 

gcgaccatca tcaacgaagg cgacatcaat actcgccgcg ttaaatccgt cactttctgc 780 

gcacgcagca ttccgcacat gctggagcac ttccgtgccg gttctctgct ggtgacttcc 840 

gcagaccgtc ctgacgtgct ggtggccgct tgcctggcag ccatgaacgg cgtagaaatc 900 

ggtgccctgc tgctgactgg cggttacgaa atggacgcgc gcatttctaa actgtgcgaa 960 

cgtgctttcg ctaccggcct gccggtattt atggtgaaca ccaacacctg gcagacctct 1020 

ctgagcctgc agagcttcaa cctggaagtt ccggttgacg atcacgaacg tatcgagaaa 1080 

gttcaggaat acgttgctaa ctacatcaac gctgactgga tcgaatctct gactgccact 1140 

tctgagcgca gccgtcgtct .gtctccgcct gcgttccgtt atcagctgac fgaacttgcg 1200 

cgcaaagcgg gcaaacgtat. cgtactgccg gaaggtgacg aaccgcgtac cgttaaagca 1260 

gccgctatct gtgctgaacg „tggtatcgca acttgcgtac tgctgggtaa tccggcagag 1320 

atcaaccgtg ttgcagcgtc tcagggtgta gaactgggtg cagggattga aatcgttgat 1380 

ccagaagtgg ttcgcgaaag ctatgttggt cgtctggtcg aactgcgtaa gaacaaaggc 1440 

atgaccgaaa ccgttgcccg cgaacagctg gaagacaacg tggtgctcgg tacgctgatg 1500 

ctggaacagg atgaagttga tggtctggtt tccggtgctg ttcacactac cgcaaacacc 1560 

atccgtccgc cgctgcagct gatcaaaact gcaccgggca gctccctggt atcttccgtg 1620 

ttcttcatgc tgctgccgga acaggtttac gtttacggtg actgtgcgat caacccggat 1680 

ccgaccgctg aacagctggc agaaatcgcg attcagtccg ctga.ttccgc tgcggccttc 1740 

ggtatcgaac cgcgcgttgc tatgctctcc tactccaccg gtacttctgg tgcaggtagc 1800 

gacgtagaaa aagttcgcga agcaactcgt ctggcgcagg aaaaacgtcc tgacctgatg 1860 

atcgacggtc cgctgcagta cgacgctgcg gtaatggctg acgttgcgaa atccaaagcg 1920 

ccgaactctc cggttgcagg tcgcgctacc gtgttcatct tcccggatct gaacaccggt 1980 

aacaccacct acaaagcggt acagcgttct gccgacctga tctccatcgg gccgatgctg 2040 

cagggtatgc gcaagccggt taacgacctg tcccgtggcg cactggttga cgatatcgtc 2100 

tacaccatcg cgctgactgc gattcagtct gcacagcagc agtaa 2145 

<210> 94 
<211> 1767 
<212> DNA 

<213> E. Coli ' . 



. ' <400> 94 

atgaataatt ctattaacca taaatttcat cacattagcc gggctgaata ccaggaattg 60 

ttagccgttt cccgtggcga cgctgt.tgcc gattatatta ttgataatgt ctctattctc * 120 

gacctgatca atggcggaga aatttccggc ccaattgtga ttaaaggacg ttacattgcc 180 

ggtgttggcg cagaatacac tgatgctccg gctttgcagc ggattgatgc tcgcggcgca 240 

acggcggtgc cagggtttat tgatgctcac ctgcatattg aatccagcat gatgacgccg 300 

gtcacttttg aaaccgctac cctgccgcgc ggcctgacga ccgttatttg cgaccctcat 360 

gaaatcgtca acgtgatggg cgaagccgga ttcgcctggt ttgcccgctg tgccgaacag 420 

gcaaggcaaa accagtactt acaggtcagc tcttgcgtac ccgccctgga aggctgcgat 480 
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gttaacggrg 
accggccttg 
gataaacrgg 
aaagaactta 
gaagaaggac 
gcccgcaatc 
ctctgtaccg 
attcgccgcc 
tggtcgacgg 
gccgatat eg 
ggegagcega 
gctccgccat 
caatttaege 
cact cccact 
gccgnacttg 
ggactgaatg 
ateggtcgea 
gggctgtgcg 
ctgatgagca 
gcccgtgaat 
ccagrgatcc 
ttcactacgc 



PCT/USOO/02200 



ccagttttac 
cagaaatgat 
atgeattteg 
aegectatat 
geeggaaat t 
tcaacgcgct 
atgaccgtaa 
tgatcgaaca 
cgcgccactt 
tcctgttgag 
ttgatgegea 
atggcaacac 
ccggaaaacg 
ccagcgtcta 
agegttaegg 
aaggtgcgct 
gtgecgaaga 
tggtacgtaa 
ccgacacggc 
gcggtccgtt 
ccgcgctaaa 
tggaagtcac 



ccttgaacag 
ggactaccct 
qcacctgacg 
tactgegggt 
acaactcggc 
ggcaccgttg 
cccgt gggag 
acacaatgt g 
tggtctgaat 
egatgegegt 
aaccttacag 
cattgcccgc 
etategggtc 
cagegaaaat 
geaaeggctg 
ggctgcgacg 
gatggcgctg 
eggecaggta 
gcagt cgctg 
acccgatgag 
actaaccagt 
ggaataa 



atgctcgcct 
ggegtaatta 
ctggacggtc 
attgaaaact 
atgtcgttga 
atcaacgaat 
atcgcccatg 
ccgctgca tg 
cacctcggct 
aaggtcaegg 
gcggaagagt 
cagccagttt 
at tgacgt ca 
ggttt tgatc 
gctccggctt 
gtcagccatg 
geggtcaate 
caaagtcatc 
gcggaacaaa 
ccgtttattc 
caggggctat 



ggegggacca 
gcgggcagaa 
actgcccggg 
gecacgaaag 
tgatccgega 
ttaacagccc 
aaggacacat 
tggcatat eg 
tactggcacc 
tgcagcaggt 
eggegagact 
ccgccagcga 
tccataacga 
gcgatgatgt 
gtggtttgct 
acagecataa 
aggtgattca 
tgccgttacc 
ttgaegcett 
agatggcgtt 
ttgatggcga 



teegcaggtt 
tgcgctgctc 
tttgggtggt 
ttatcagct g 
agggtccget 
geaatgeatg 
egatgeett a 
cgtcgccagc 
eggcaagcag 
actggtgaaa 
ggcacaatcc 
ctttgccct g 
attgattacg 
gagctttatt 
tggcggcttt 
tattgtggtg 
ggatggcggc 
cattgeeggg 
gaaagccgcc 
tctttctctg 
gaagt ttgee 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1767 



<210> 95 
<211> 1227 
<212> DNA 
<213> E. Coli 

<400> 95 
atggcgtatt geaatceggg 
catgtggtaa caggcatagg 
ttatctgeca gcgaggttgc 
gcacagtacg tttctgttcc 
gcagccaccc agggggctga 
aatgccagtt gggggatctc 
agtgggctgg agctggtacc 
acaggegagt taatcctgea 
ggcggcagcg ctacaaatga 
tgcgacgcca aeggcaatga 
attgatattt ccggcctcga 
gtcaccaatc cgctggtggg 
gecagrgaag cgatgattgt 
aaaaaagege tgcatgttga 
ggcgcggcgc taatggcg'tt 
aeggegctga atctggagga 
cgtattgaca gecagagtat 
aagtaccata aaceggtgat 
catcagcatg geattgatge 
gcattccgcg gggcttatga 
gcgatrggaa tgcgcaacgc 

<210> 96 
<211> 900 
<212> DNA 
<213> E. Coli 



cctggaatcc 
tatgaaaatc 
.geaggegata 
ggttgccgac 
acgtcacgcc 
eggegatgge 
tgcggaaaaa 
ggcgctggag 
tggeggegea 
aattggtttt 
tccgcgctta 
egataaegge 
tgagctggac 
tgtgaaagat 
tcttggtgcg 
acatattcac 
teaegggaag 
tggcattgcg 
ggt ctr cage 
caatatctgc 
ggggtga 



aggecgaata 
gtaatcgccc 
gaaaaaggat 
ggtggcgaag 
tgggttacag 
aaaaccgcgt 
cgegat ccac 
ageggtgega 
ggcatggtac 
ggcggcggta 
aaagattgcg 
gcatcgcgca 
aataacct ct 
gtccccggtg 
gaactgaaaa 
gattgtzaege 
gtaccgattg 
ggtagcctga 
gtattgacca 
cgtgcttcac 



agagaaaege 
cagactctta 
ttcgggaaat 
gaacggtgga 
ggccgctggg 
ttattgaaat 
t cgtgaccac 
caaacattat 
aggcgctggg 
gtcttaatac 
teattcgegt 
tctttggccc 
ct cactatgc 
caggagctgc 
gtggtattga 
tggtgatcac 
gtgtcgcaaa 
ccgatgatgt 
gcataggtac 
gtaatatege 



cctccggcgt 
taaagaaagt 
ttttcctgat 
agegatgatt 
cgagaaagtg 
ggcggcggcc 
ttcacgcggc 
tateggcatt 
ggegaaatta 
tctgaatgat 
cgcttgtgat 
acaaaaggga 
cgaggtcatt 
gggtggtatg 
aategtcact 
cggtgaaggg 
cgtggcgaag 
tggcgttgta 
gttggacgaa 
cgcgacactg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1227 



<400> 96 • 

atgartgata tgactatgaa agttggtttt attggcctgg ggattatggg taaaccaatg 60 

agtaaaaacc ttctgaaagc aggtt actcg • ctggtggttg ctgaccgtaa cccagaagct 120 

attgergacg tgattgctgc aggtgcagaa acagegtcta eggctaaage gategctgaa 180 

cagtgcgacg tcatcataac catgctgcca aactcccctc atgtgaaaga ggtggcgctg 240 
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ggtgagaatg gcattattga aggcgcgaag ccaggtacgg tattgatcga tatgagttct 300 

atcgcaccgc tggcaagccg tgaaatcagc gaagcgctga aagcgaaagg cattgatatg 360 

ctggatgctc cggtgagcgg cggtgaaccg aaagccatcg acggtacgct gtcagtgatg 420 

gtgggcggcg acaaggctat tttcgacaaa tactatgatt tgatgaaagc gatggcgggt 480 

tccgtggtgc ataccgg.gga aatcggtgca ggtaacgtca ccaaactggc aaatcaggtc 540 

attgtggcgc tgaatattgc cgcgatgtca gaagcgttaa cgctggcaac taaagcgggc 600 

gttaacccgg acctggttta tcaggcaatt cgcggtggac tggcgggcag taccgtgctg 660 

gatgccaaag cgccgatggt gatggaccgc aacttcaagc cgggcttccg tattgatctg 720 

catattaagg atctggcgaa tgcgctggat acttctcacg gcgtcggcgc acaactgccg 780 

ctcacagctg cggttatgga gatgatgcag gcactgcgag cagatggttt aggaacggcg 840 

gatcatagcg ccctggcgtg ctactacgaa aaactggcga aagtcgaagt tac.tcgttaa 900 



<210> 97 
<211> 771 
<212> DNA 
<213> £ . Coli 

<400> 97 

atgaataacg at gtttt ccc gaataaattc aaagccgcac tggctgcgaa acaggtacaa 60 

attggttgct ggtcagcact ctctaacccg attagcactg aagttcttgg tttggctggg 120 

tttgactggc tggtgctgga tggcgaacat gcgccaaacg atatctccac gtttattccg 180 

cagttaatgg ccttgaaagg cagcgccagc gcgccagtag tgcgagtgcc gaccaacgag 240 

ccggtaatta ttaagcgtct tctggatatc ggtttctata acttcctgat tccttttgta 300 

gaaacaaaag aggaagcaga gctggcggtg gcatcaaccc gttacccacc ggaaggcatt 360 

cgcggcgtct ccgtttctca ccgcgccaat atgtttggca ccgtggcgga ttatttcgct 420 

cagtcgaaca agaacatcac tattctggtc cagatagaaa gtcagcaggg cgtagataac 480 

gtcgatgcca ttgccgctac cgaaggcgta gacggcatct tcgtcggccc cagcgatctg 540 

gccgcggcat taggccatct cggcaatgca tcacacccgg atgtacaaaa agcaattcag 600 

cacattttta accgtgccag cgcgcacggc aaacccagcg gtatcctcgc gccggtcgaa 660 

gccgatgcgc gtcgttatct ' ggaatggggc gcgacgtttg tggctgtcgg cagcgatctc 720 

ggcgtcttcc gctctgccac tcagaaactg ■ gctgatacct ttaaaaaata a 771 

<210> 98 
<211> 1335 
<212> DNA 
<213> E. Coli 

<400> 98 

atgattctgg acaccgttga cgaaaaaaag aaaggcgtgc atacccgcta tttaatatta 60 

ctgattattt ttattgttac cgccgttaac tacgccgatc gtgcaacgct gtctattgct 120 

ggtaccgaag tggcaaaaga gttgcagtta agtgcggttt cgatgggtta catcttctcc 180 

gcttttggct gggcctactt gctgatgcaa atccccggcg gctggctgct tgataagttt 240 

ggctcgaaaa aagtttacac ctacagcctc tttttctggt cgctattcac cttcctgcaa 300 

ggctttgttg atatgttccc gctggcctgg gcagggatct ccatgttctt tatgcgcttt 360 

atgctcggct tctcggaagc gccatcattc ccggcgaacg cccgaattgt cgccgcctgg 420 

ttcccgacga aagaacgtgg tactgcctcc gccatcttta actcggcgca atatttctcg 480 

ctggcgctct ttt.cgccgct gcttggctgg ctgactttcg cctggggctg ggagcacgtc • 540 

tttaccgtta tgggggtgat tggttttgtg ctgacggcgc tgtggatcaa gttgattcat ■ 600 

aacccgacag atcacccacg tatgtctgcg gaagagctga agtttatctc tgaaaatggc 660 

gcggtggtcg atatggacca caaaaagccg ggcagtgcgg cagcaagcgg acccaaactg 720. 

cattacatca agcaattgct ctctaaccgc atgatgctgg gcgtattttt cggacaatat 780 

tttatcaaca ccatcacctg gttcttcctc acctggttcc cgatttatct ggtgcaggaa 840 

aaaggcatgt cgattctgaa agtgggtctg gtcgcctcga ttccagcact gtgtggtttt 900 

gcgggcggcg tgctgggagg tgtcttctcg gattatctga tcaaacgcgg tttatccctg 960 

accctggcac gtaagctacc gatrgtgctg ggaatgttgc tggcttccac catcatctta 1020 

tgtaactaca ccaacaacac cacgctggtg gtcatgctga tggcgctggc tttctttggc 1080 

aaaggatttg gtgcgctggg ctggccggtg atttctgaca ccgcgccgaa agagattgtt 1140 

ggcctctgcg gcggcgtctt taacgtcttt ggcaatgttg cctccattgt cactccactg 1200 

gtgatt'ggct acctggtaag tgaactgcac tccttcaatg cagcactggt tttcgtggga 1260 

tgttcagcgc tgatggcgat ggtctgctac ctcttcgtag ttggcgacat taaacgtatg . 1320 

gaattgcaga aataa 1335 
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<210> 99 
<211> 1536 
<212> DNA 
<213> E . Coli 

<400> 99 

atgcaaacga gtgatacccg cgcgttaccg ctactttgcg cccgctcggt ttataaacag 
tattcagggg tcaatgtcct gaaaggcatc gattttacgt tgcatcaggg ggaggtccac 
gccctgctcg gcggcaatgg tgccggtaaa tcgacgttaa tgaagattat tgccggtatt 
acccctgctg atagcggtac gctggagatt gagggcaaca actacgtcag attaacgcca 
gttcatgctc atcagctggg tatttatctc gttccccagg aaccgctgct tttcccaagc 
ctgtcgataa aagaaaacat cctgtttggg ctggcaaaaa aacagctctc catgcagaaa 
atgaagaact tgctggcggc gctgggctgc cagtttgatc tgcatagtct ggcaggatcg 
ctggatgrcg ccgatcgcca aatggtggaa atcctccgcg ggctgatgcg cgactcgcgg 
attctgatcc tcgatgaacc taccgcctcg cttacccctg cggaaaccga acgcttgttt 
agtcgcttgc aagagctgct tgctactggc gtgggtattg tttttatctc gcataagctg 
ccggaaattc gccagattgc cgatcgaatt agcgtgatgc gcgacggaac catcgcctta 
agcggcaaaa ccagcgaact gtctaccgac gacattattc aggccatcac cccagcggta 
cgggaaaaat cgctctctgc cagccaaaaa ttatggctgg agttacctgg taaccgccca 
caacatgccg ccggaacgcc ggtgctgaca ctggaaaatc tgaccggcga aggtttcagg 
aatgtcagcc tgacgctcaa tgccggagaa attctgggcc tggctgggct ggtgggggcc 
ggacgcacag aactggccga gacgctctat ggtctgcgta ctttgcgtgg cggacgcatt 
atgctgaatg gtaaagagat caataaatta tccactggag aacgtttact gcgcggtctg 
gtttatctgc cggaagatcg ccagtcatcc ggactgaatc tcgatgcttc gctggcctgg 
aacgtctgcg cgcttactca taaccttcgt ggattctggg cgaaaaccgc gaaagataat 
gccaccctgg aacgttatcg tcgggcgctg aatattaaat tcaaccaacc ggaacaagct 
gcacggacat tatccggtgg caaccagcaa aaaatcctca ttgccaaatg cttggaagct 
tcgccgcaag tattgattgt cgatgagccg acgcgcggcg tggatgtctc ggcccgtaat 
gatatctacc agctgttgcg cagcatcgcc gcacaaaatg tggctgtgct gcttatctcc 
tccgacctgg aagagatcga actgatggca gatcgtgtgt atgtgatgca tcagggcgaa 
attacccact ctgcactgac -cgagcgcgat attaatgtcg agactattat gcgcgttgcc 
ttcggcgata gtcagcgtca ggaggcgtca tgctga 

<210> 100 
<211> 1029 
<212> DNA 
<213> E. Coli 



<400> 100 

atgctgaagt ttattcagaa caaccgtgaa atcacggcac tgctggcggt ggtgctgctg 60 

tttgtattac ccggttttct cgaccgccag tatttaagtg tgcaaacgct gaccatggtt 120 

tatagcagcg cgcaaatcct gatcctgctg gcaatgggcg cgacgctggt aatgcttacg . 180 

cgcaatattg atgtttcagt gggttcgatt accggaatgt gcgcggtgct gttggggatg 240 

ttactgaacg caggatattc actacctgtt gcttgtgtcg cgactttact gcttggtttg 300 

ctcgcgggat ttttcaacgg tgtcctggtc gcgtggctaa agatccctgc cattgttgcc 360 

acccttggca cgttagggtt gtacagaggc atcatgttgc tgtggactgg cggcaaatgg 420 

attgaagggt tacccgccga actgaaacag ctctccgccc cgctgctgct tggcgtttca 480 

gcaattggtt ggttgacgat aattctggtg gcatttatgg cctggctgct ggcaaagacg .540 

gcgtttggac gcagt tttta tgccacgggc gataatttac agggcgctcg tcaactgggc - 600 

gttcgtactg aagccattcg cattgtggca ttttcgttga acggctgcat ggcggcactg 660. 

gcgggaattg tgtttgettc gcagattggt tttatcccca accagaccgg taccgggctg 720 

gagatgaaag caattgcagc ctgcgtgctg ggcggcatta gtttgctcgg tggttccggt 780 

gcgatcattg gtgcggtact cggcgcatgg ttcctgacgc agatcgatag cgtactggtg 840 

ctgttgcgca ttccggcatg gtggaatgat tttatcgcgg gtctggttct gctggcggtg 900 

ctggtgtttg atggacgcct gcgttgtgcg ctggaacgta atctacggcg gcaaaaatat -960 

gcccgcttta tgacgccacc gccatccgtt aaacccgctt cgtcaggtaa aaaacgggag 1020 

gccgcataa " 10.29 



<210> 101 
<211> 993 
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<212> DNA 
<213> E. Coli 
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<400 
atgcgtattc 
gcatttggtg 
gactttatct 
atcgatattt 
caaagtggtg 
gggctgatca 
ggcacgctgt 
gggtacgaag 
ctgggactcc 
ctgcataaaa 
ctttatagcg 
gcgtctgcgg 
ggtgcgtcgt 
ggtggttccg 
caaggtttgc 
cttatcgtcg 
ctggcgcgtc 



> 101 
gctacggttg 
caattaaccc 
gcattggcat 
cgtttggttc 
tgccgatgcc 
acgccggatt 
atctgtttgc 
gtattggtgg 
ccgttccgct 
cccatgccgg 
cgattccagt 
tcgccgctgt 
ttctgatgcc 
gttccattat 
aaatggcagg 
ttgtcgtagg 
gggccaataa 



ggaactggct 
gcgaatgtta 
tgtcgcccta 
gaccatcggc 
gctggcgata 
aattatctat 
cggaagcgct 
attcccgatg 
gattatcttc 
acgtaatgtg 
taaccgtacc 
gctgctggta 
cgccatcacc 
cggcaccgcc 
agtgccaaat 
tcgttccgtt 
cccattgcca 



cttgccgcac 
gatct caata 
ccgctaacga 
ct ctgcgcca 
ct cctgacct 
accaaagtta 
ctgctgcttt 
gcgtttacag 
ctgatatgtc 
tttttgattg 
tt atgtgcgc 
tcgtattttg 
gccgtggtgc 
attgcggttt 
caggtgtcca 
agcctgcatc 
taa 



tgctcgttat 
tgttgctgtt 
tggtgattgt 
ttgcattggg 
tactgctcgg 
acccgctggt 
ccggtatggc 
atttcgctaa 
tcctcgtttt 
ggcaaagccc 
tctatgccat 
gttcagcacg 
ttggcggggc 
tattagtggg 
gcgccctttc 
gccagcaaat 



tgagattgtc 
cagcaccagt 
cagtggcggg 
cgtactgttt 
cgcattgtgc 
gattacgctt 
cggagcgacg 
cctggatgtg 
ctggctctgg 
gcgcgtggcg 
gacggggctg 
ttccgatctc 
caatatttat 
atatttgcaa 
cggtgcgcta 
taaagagtgg 



<210> 102 

<211> 1023 

<212> DNA 

<213> E . Coli 

<400> 102 

atgacacttc atcgctttaa gaaaatcgcc ttacttagcg 
tctatgaatg tgcaggccgc agagcgtatt gcatttattc 
ttttttacca gcggtggcaa cggcgcacaa caagcgggta 
acctacgacg ggccgacaga a'cccagtgtt tctggtcagg 
gtcaatcaag gttataacgc cattatcgtt tctgcggttt 
gcactgaaac gcgccatgca' acgtggtgtg agagtgctga 
ccggagtgcc gctcttacta cattaatcag ggaacgcccg 
gtggatatgg cggcgcgtca ggtgaataaa gacaaagcca 
agccccaccg ttacggacca aaaccagtgg gtgaaagaag 
gagcatccag gctgggaaat tgtcactacg cagtttggct 
ttacaaaccg cagaaggaat attaaaagcg tatagcgatc 
gatgccaacg ccctgcccgc tgccgcacaa gccgcagaaa 
gcgattgtcg gattcagtac gccaaatgtg atgcgcccgt 
aaagaatttg gcctgtggga tgtggttcag caaggcaaaa 
gcattattga aaaaaggatc aatgaaaacg ggcgacaagc 
caggttgaag tctcgccaaa cagcgttcag ggctatgact 
atcgtactgt taccggagcg cgtgatattc aacaaagaga 
tga 

<210> 103 
<211> 876 
<212> DNA 
<213> E. Coli 



ct cttggcat 
ccaaactggt 
aagagctggg 
tacagttgat 
cgcctgatgg 
cctgggactc 
cccagttagg 
aagtcgcgtt 
cgaaagcgaa 
ataacgatgc 
tcgacgccat 
acttgaaaaa 
atgtagagcg 
tttcagtgta 
tggatatcaa 
acgaagcgga 
atatcggcaa 



tgccgcaat c 
tggcgtggga 
cgttgatgtg 
taataacttc 
cttgtgtccg 
tgatactaaa 
aggtatgttg 
tttctactca 
aatcgccaaa 
cactaaatcg 
tatcgccccc 
tgacaaagt a 
cggcacggt g 
tgtcgcggat 
gggcgtaggt 
tggtaatggc 
atacgatttc 



60 
120 
180 
240 
300 
360 
420 
.480 
540 
600 
660 
720 
780 
840 
900 
960 
993 



60 
120 
18 0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1023 



<400 
atggcagat t 
aatatccctt 
tcgcggatat 
tttcagggac 
catgccgatg 
aataggccgg 
gaagccgtgg 
caggtttata 



> 103 
tagacgatat 
t taccctgaa 
ttaatccgaa 
cgactaccgg 
tattaatgtg 
tggtactgcg 
cgttatcgat 
t cggcagcga 



taaagatggt 
aggttgcggt 
aacgggtaaa 
acttgaacgc 
tacgcgcggc 
ggcgtcaggt 
ggat gacgcc 
atatgaacat 



aaagattttc 
gcgctggatt 
accgtgatgc 
attgatataa 
att ttgcgca 
gcgaactcta 
gtgcgcctga 
cagtcgatca 



gtaccgatca 
ggggaatgca 
tggcttttga 
a tat cgcccc 
gcgtagttcc 
ttctggcgga 
ac.agt tgcgc 
aaaatattat 



accgcaaaaa 
gtcacgctta 
ccatggttat 
gctgtt tgaa 
ccctgcgacc 
attaagtaat 
ggtggcggcg 
tcagctggtt 



. 60 
120 
180 
240 
300 
360 
420 
480 
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gatgccggaa tgaaagtggg aatgccgacc atggccgtga ctggcgtggg caaagatatg 540 

gtgcgcgatc agcgttattt ctcgctcgcg actcgaatcg ccgctgaaat gggggcgcaa 600 

attatcaaaa cctattatgt cgaaaaaggt tttgaacgga ttgttgccgg atgtccggta 660 

cccattgtta ttgctggcgg taaaaaatta ccggagcgcg aggcgctgga aatgtgctgg 720 

caggctatcg atcagggcgc ttctggtgtg gatatggggc gtaatatttt ccagtctgac 780 

catccggtgg cgatgatgaa agccgtacag gcggtggttc accataacga aacggctgat 840 

cgggcatatg aactctatct gagtgaaaaa cagtaa 375 

<210> 104. 
<211> 291 
<212> DNA 
<213> E. Coli 



<400> 104 



atgcacgtca cactggttga aattaacgtt catgaagaca aggttgacga gtttatcgaa 60 

gtttttcgcc agaaccacct gggctctgta caggaagaag gcaatttgcg cttcgatgtc 120 

ttacaggacc cggaagtgaa ttcgcgcttt tatatctacg aagcctataa agatgaagac 180 

gcagtggcgt tccataaaac cacgccccac tacaaaacct gtgtcgcgaa actggaatct 240 

ttaatgaccg ggccgcgtaa aaaacgtctg ttcaatggtt tgatgccgtg a 291 



<210> 105 
<211> 1152 
<212> DNA 
<213> E . Coli 



<400> 105 
atgtttgaac caatggaact 
ggcggcggta atgctgttga 
gcggtaaata ccgatgcaca 
ggtagcggta tcaccaaagg 
gctgatgagg atcgcgatgc 
gctgcgggta tgggtggtgg 
aaagatttgg gtatcctgac 
aagcgtatgg cattcgcgga 
atcactatcc cgaacgacaa 
gcgtttggcg cagcgaacga 
actcgtccgg gtttgatgaa 
ggctacgcaa tgatgggttc 
gaaatggcta tctcttctcc 
ctggttaaca tcacggcggg 
accatccgtg catttgcttc 
gatatgaatg acgagctgcg 
cctgaaatca .ctctggtgac 
cagcatggga tggctccgct 
aatgcgccgc aaactgcgaa 
caagctgatt aa 



taccaatgac 
acacatggtg 
agcgctgcgt 
actgggcgct 
attgcgtgcg 
taccggtaca 
cgttgctgtc 
gcaggggatc 
act gctgaaa 
tgtact gaaa 
cgtggacttt 
tggcgtggcg 
gctgctggaa 
ct tcgacct g 
cgacaacgcg 
cgt aaccgt t 
caataagcag 
gacccaggag 
agagccggat 



gcggtgatta 
cgcgagcgca 
aaaacagcgg 
ggcgctaatc 
gcgctggaag 
ggtgcagcac 
gtcactaagc 
actgaactgt 
gttctgggcc 
ggcgctgtgc 
gcagacgtac 
agcggtgaag 
gatatcgacc 
cgtctggatg 
actgtggtta 
gttgcgacag 
gt tcagcagc 
cagaagccgg 
tatct ggata 



aagtcatcgg 
ttgaaggtgt 
ttggacagac 
cagaagttgg 
gtgcagacat 
cagtcgtcgc 
ctttcaactt 
ccaagcatgt 
gcggtatctc 
aaggtatcgc 
gcaccgtaat 
accgtgcgga 
tgtctggcgc 
agttcgaaac 
tcggtacttc 
gtatcgg'cat 
cagtgatgga 
ttgctaaagt 
tcccagcatt . 



cgtcggcggc 
tgaattcttc 
gattcaaatc 
ccgcaatgcg 
gg.tctttatt 
tgaagtggca 
tgaaggcaag 
ggactctctg 
cctgctggat 
tgaactgatt 
gtctgagatg 
agaagctgct 
gcgcggcgtg 
ggtaggtaac 
tcttgacccg 
ggacaaacgt 
tcgctaccag 
cgtgaatgac 
cctgcgtaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1152 



<210> 106 
<211> 3048 
<212> DNA 
<213> E. Coli 



<400 
atggacgtca 
gtagcggcat 
ttacgcgcta 
atgtatagcc 
gacccggatc 
tacgtcaaca 
tggcagcgca 
cgtgacgcta 



> 106 
gtcgcagaca 
tgggctttgc 
aagagatccg 
tgggtgatgg 
atccggtaag 
gtgaaaaccg 
t tagctggga 
actttattga 



attttttaaa 
cccgaagcaa 
taacacctgc 
cgcaaaaaac 
ccgtggtgcg 
tctgcgct ac 
agaagcattc 
aaagaacgag 



atctgcgcgg 
gcactggct c 
acatactgtt 
gccagagaag 
ctgtgcccga 
cc'ggaatatc 
tcccgtattg 
cagggcgtaa 



gcggtatggc 
aggcgcgaaa 
ccgtaggttg 
cgatt tat ca 
aaggggccgg 
gtgcgccagg 
cgaagctgat 
cggtaaaccg 



tggaacaaca 
ctacaaat ta 
cgggctattg 
cattgaaggt 
tttgctggat 
ttctgacaaa 
gaaagctgac 
ttggctttct 



60 
120 
180 
240 
300 
360 
420 
480 
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accggtatgc tgtgtgcctc cggtgccagc aacgaaaccg ggatgctgac ccagaaattt 540 

gcccgctccc tcgggatgct ggcggtagac aaccaggcgc gcgtctgaca cggaccaacg 600 

gtagcaagtc ttgctccaac atttggtcgc ggtgcgatga ccaaccactg ggtggatatc 660 

aaaaacgcta acgtcgtgat ggtgatgggc ggtaacgctg ctgaagcgca tcccgtcggt 720 

ttccgctggg cgatggaagc gaaaaacaac aacgacgcaa ccttgatcgt tgtcgatccc ■ 780 

cgttttacgc gtaccgcttc tgtggcggat atttacgcgc ctattcgttc cggtacggac 840 

attacgttcc tgtctggcgt tttgcgctac ctgatcgaaa acaacaaaat caacgccgaa 900 

tacgttaagc attacaccaa cgccagcctg ctggtgcgtg atgattttgc tttcgaagac 960 

ggtctgttca' gcggctacga cgctgaaaaa cgtcaatacg ataaatcgtc ctggaactat 1020 

cagctcgatg aaaacggcta tgcgaaacgc gatgaaacac tgactcatcc gcgctgtgtg 1080 

tggaacctgc tgaaagagca cgtttcccgc tacacgccgg acgtcgttga aaacatctgc 1140 

ggtacgccaa aagccgactt cctgaaagtg tgtgaagtgc tggcctccac cagcgeaccg 1200 

gatcgcacaa ccaccttcct gtacgcgctg g'gctggacgc agcacactgt gggtgcgcag 1260 

aacatccgta ctatggcgat gatccagttg ctgctcggta acatgggtat ggccggtggc 1320 

ggcgtgaacg. cattgcgtgg tcactccaac attcagggct tgactgactt aggcctgctc 1380 

tctaccagcc tgccaggtta tctgacgctg ccgtcagaaa aacaggttga tttgcagtcg 1440 

tatctggaag cgaacacgcc gaaagcgacg ctggctgatc aggtgaacta ctggagcaac 1500 

tatccgaagt tcttcgttag cctgatgaaa tctttctatg gcgatgccgc gcagaaagag 1560 

aacaactggg gctatgactg gctgccgaag tgggaccaga cctacgacgt catcaagtat 1620 

ttcaacatga tggatgaagg caaagtcacc ggttatttct gccagggctt taacccggtt 1680 

gcgtccttcc cggacaaaaa caaagtggtg agctgcctga gcaagctgaa gtacatggtg 1740 

gttatcgatc cgctggtgac tgaaacctct accttctggc agaaccacgg tgagtcgaac 1800 

gatgtcgatc cggcgtctat tcagactgaa gtattccgtc tgccttcgac ctgctttgct 1860 

gaagaagatg gttctatcgc taactccggt cgctggttgc agtggcactg gaaaggtcag 1920 

gacgcgccgg gcgaagcgcg taacgacggt gaaattctgg cgggtatcta ccatcatctg 1980 

cgcgagctgt accagtccga aggtggtaaa ggcgtagaac cgctgatgaa gatgagctgg 2040 

aactacaagc agccgcacga accgcaatct gacgaagtgg ctaaagagaa caacggctac 2100 

gcgctggaag atctctatga cgctaatggc gtgcttattg cgaagaaagg tcagttgctg 2160 

agtagctttg cgcatctgcg tgatgacggt acaaccgcat cttcttgctg gatctacacc 2220 

ggtagctgga cagagcaggg caaccagatg gctaaccgcg ataactccga cccgtccggt 2280 

ctggggaata cgctgggatg ggcctgggcg tggccgctca accgtcgcgt gctgtacaac 23.40 

cgtgcttcgg cggatatcaa cggtaaaccg tgggatccga aacggatgct gatccagtgg 2400 

aacggcagca agtggacggg taacgatatt cctgacttcg gcaatgccgc accgggtacg 2460 

ccaaccgggc cgtttatcat gcagccggaa gggatgggac gcctgtttgc catcaacaaa 2520 

atggcggaag gtccgttccc ggaacactac gagccgattg aaacgccgct gggcactaac 2580 

ccgctgcatc cgaacgtggt gtctaacccg gttgttcgtc tgtatgaaca agacgcgctg 2640 

cggatgggta aaaaagagca gttcccgtat gtgggtacga cctatcgtct gaccgagcac 2700 

ttccacacct ggaccaagca cgcattgctc aacgcaattg ctcagccgga acagtttgtg 2760 

gaaatcagcg aaacgctggc ggcggcgaaa ggcattaata atggcgatcg tgtcactgtc 2820 

tccagcaagc gtggctttat ccgcgcggtg gctgtggtaa cgcgtcgtct gaaaccgctg 2880 

aatgtaaatg gtcagcaggt tgaaacggtg" ggtattccaa tccactgggg ctttgagggt 2940 

gtcgcgcgta aaggttatat cgctaacact ctgacgccga atgtcggtga tgcaaactcg 3000 

caaacgccgg aatataaagc gttcttagtc aacatcgaga aggcgtaa 3048 

<210> 107 
.<211> 885 
<212> DNA 
<213> £ . Coli 

<400> 107 

atggctatgg aaacgcagga cattatcaaa aggtccgcaa ctaactccat cacgccgcct 60 

tctcaggtgc gtgattacaa agcagaagtc gcaaaactta tcgacgtttc cacctgtatc 120 

ggctgtaaag cctgtcaggt ggcgtgttcg gagtggaacg acatccgtga tgaagtgggg 180 

cactgcgtcg gggtttacga taaccccgcc gatctgagcg ccaagtcctg gacggtgatg 240 

cgctttagcg" aaaccgaaca gaacggcaag ctg'gagtggc tgatccgtaa agacggctgt 300 

atgcactgtg aagatcccgg ctgcctgaag gcgtgcccgt ctgctggtgc aatcattcag 360 

tacgctaacg ggattgtcga tttccagtcg gaaaactgca tcggctgtgg ttactgcatt. 420 

gccgggtgtc cgtttaatat tccgcgcctc aacaaagagg ataaccgggt atataaatgc 480 

acgctc-tgcg tcgatcgcgt cagcgtcggc caggaaccgg. cttgtgtgaa aacctgtccg 540 

accggggcta tccacttcgg caccaagaag gagatgctgg agctggcgga acagcgcgtg 600 

gcgaaactga aagcgcgtgg ttacgaacat gctggcgtct acaacccgga aggggtcggt 660 
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ggtacgcaog tta^Tacgt gctgcatcac gccgatcagc cggagctgta tcacggtctg 

ccgaaagatc cgaagatcga cacctcggta agcctgtgga aaggcgcgtt gaaaccgctg 

gcagcggctg gctttattgc cacttttgcc gggttgattt tccactacat cggtattggc 

ccgaataagg aagtggacga tgacgaggag gatcatcatg agtaa 

<210> 108 
■<211> 654 • 
<212> DNA 
<213> E. Coli 



720 
780 
840 
885 



<400 
atgagtaagt 
gtggtgattt 
caatggctga 
ggcattgcga 
ccggataaga 
cataaagtgg 
a tgagcatga 
cagtacttcc 
atcctgatcc 
aaagggatga 
t atcgtgaaa 



> 108 
cgaaaatga t 
gcttcttcct 
cgcaaacctt 
t ttt cgt cgc 
aagatattcc 
cggatgt egg 
ttttcgtgct ' 
egatgeaggt 
acgccatcct 
tcgaagggaa 
tcgagaaggc 



tgtgcgcacc 
ggtggcgctg 
cggtacgccg 
actgatgttt 
gtggctgttg 
taagtacaac 
gctggtgacc 
tgttcgctac 
gatccatatg 
ggtaagtcgt 
agaagegaaa 



aaatttattg 
teegggattt 
cagatgggac 
atgtttgtgc 
aacattgtcg 
geegggcaaa 
ggggtgatta 
agectgetga 
tatatggcat 
cgctgggcga 
aaagagagtg 



atcgcgcctg 
cgttcttctt 
geattttgea 
gttttgtgca 
aagtattgaa 
agatgatgtt 
t ctggcgt cc 
tccacgcggc 
tttgggtgaa 
agaaacacca 
aagaaggga t 



tcactggacc 
cccgacgctg 
cccgttcttc 
t cacaacatc 
aggcaatgag 
ctggtcgatc 
gtactttgcg 
tgegggtate 
aggatcgatt 
tccgcgctgg 
ataa 



<210> 109 

<211> 261 

<212> DNA . 

<213> E . Coli 

<400> 109 

atggcgttgt taatcactaa aaaatgeate aattgtgata tgtgtgaacc cgaatgcccg 

aatgaggega tttcaatggg agatcatatc tacgagatta a'cagcgataa gtgtaccgaa 

tgegtaggge actacgagac. accaacctgc cagaaggtgt gcccgatccc caatactatt 

gtgaaagatc eggegcatgt cgagacagaa gaacagttgt gggataaatt tgtgctgatg 
caccacgcgg a.taaaattta a 

<210> 110 
<211> 1203 
<212> DNA 
<213> E . Coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
654 



60 
120 
180 
240 
261 



<400 
atgcaaagtg 
ggcttacagg 
gcggcgaatg 
ctcacccgtc 
atggaagtgt 
tatagecate 
gcgcatcagt 
ggagaaaatg 
attggegegg 
tgggattatc 
gtggcgcggc 
catctttget 
agtgaagacg 
aaggttgaga 
tcgcaccgtc 
ggggtaaatc 
cgt cagggga 
cacagtgegg 
aatceggega 
gttaagcege 



> 110 
ttgatgtagc 
ggageggett 
caccaccaca 
ttggcgtctg 
gggacaaaga 
ttgggcatat 
cgtcagatat 
aaaccttcct 
aeggegctaa 
agcat caege 
aggttttcca 
cgattgtctg 
aatttaatcg 
gcgcgcgt ca 
tggcgctggt 
teggctttat 
aagacat egg 
cgttgatgct 
aaaaactget 
aacttatccg 



cattgttggc 
acgcgttgcc 
actgegegtt 
gcaggacatt 
cagctt t ggt 
cgttgaaaat 
cact ctgtta 
gaegctgaaa 
t tcct ggttg 
gctggtagcg 
tggegaagge 
gtcactgtcg 
cgcgttaaat 
ggt gttccca 
gggcgacgcc 
ggatget gca 
gcagtacatt 
ggctggtatg 
gcgtgatatt 
ccaggcaatg 



ggcggcatgg 
gtactggagc 
t eggctatea 
ctctctcgta 
cacatttcgt 
tcagtgattc 
gcccccgcag 
gatggcagca 
cgcaacaaag 
accattcgca 
attctggect 
ccagaggaag 
ategcttttg 
ctgaegggge 
gcacatacca 
gagctgattg 
tatctgegtc 
cagggattcc 
ggtttgaaac 
ggattaaacg 



tggggctggc 
agegegtaca 
atgccgccag 
gggccagctg 
ttgacgatca 
actacgcgct 
aattacagca 
tgttaacggc 
ccgatattcc 
eggaagaace 
ttttaccget 
egcageggat 
ataat cgect 
gttatgcgcg 
tt cacccgct 
ccgaactgaa 
getatgageg 
gcgatctgtt 
tggccgacac 
atttgectga 



ggttgcctgt 
ggaacctctg 
cgaaaaatta 
t tatcaeggt 
aagcat gggc 
gtggaacaaa 
ggtcgcctgg 
gegtet ggtg 
gctgactttc 
geatgatgeg 
tagegatccg 
gcagcaggca 
gggctt atgc 
ccagtt tgee 
ggcggggcag 
aeggttgeat 
tagcegcaag 
tt ccggtacc 
gcttcctggc 
atggct gcgt 



60 
120 
180. 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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<210> 111 

<211> 1179 

<212> DNA 

<213> E . Coli 



<400: 
atgagcgtaa 
cggttaagtc 
gctcatccgg 
gcgcgcatcg 
gtcagcgatc 
gcgctgggac 
aaagcacctg 
agtcacgttg 
gctgatggca 
tacgaacaac 
tttgaacgct 
t cgctggtct 
aagttttgcc 
ggtaaacgca 
accgtgctgg 
ctcggtatgc 
gaagacatgg 
gaagcaacca 
ctggttgtcg 
gtgctggcgc 



111 
tcatcgtcgg 
acggggcgct 
gctttgatgg 
gcgtctggca 
gtggt cacgc 
aggttgtcga 
gcgtaacgct 
aagtgacgct 
cccattcagc 
t ggccgtgat 
ttacgcaaca 
ggtgtcatcc 
gtgaactcca 
gtgct tatcc 
tgggcaatgc 
gagatgtgat 
gggat tacgg 
ttggcgtcac 
ggcgcaacat 
agcgcaccct 



tggcggcatg 
gccggtacat 
acgagcgata 
atct ctggcg 
t ggatt tgtc 
att gcacaat 
gcattgccct 
ggagagtggc 
gttagccacc 
tgccaacgtt 
tggcccgctg 
actggaacgg 
gtcggccttt 
gctggcgtta 
ggcgcaaact 
gagtctt gcg 
cgtar tgtgc 
ggacagcctt 
cgggctgatg 
cggttgggtg 



gcgggcgcga 
ttgattgaag 
gcgctggcgg 
gattgcgcaa 
accctcgccg 
gtcgggcaac 
gatcgcgtgg 
gagacgctga 
gcgtgcggcg 
gctacttccg 
gcgatgttgc 
cgcgaagagg 
ggctggcgac 
acccacgccg 
ct gcacccga 
gaaaccct ga 
cgttatcagc 
gtacatcttt 
acgatggaat 
gcgcgttga 



cgctggcgct 
cgactgcgcc 
cgggtacctg 
ct gccat cac 
cagaagatt a 
ggctgtttgc 
ctaacgttgc 
cgggccgcgt 
tt gactggca 
tt gcgcatga 
cgatgt ctga 
tgt tgtcgtg 
ttgggaaaat 
ccagatctat 
t t gccgggca 
ct caggcgca 
agcgtcgaca 
ttgccaaccg 
tatt cacccc 



ggctatttcc 
agagt cacat 
tcagcaactg 
caccgtgcat 
ccaactggcg 
at-tgctgcgt 
ccgtactcag 
get ggtagca 
geaggagect 
agggegeget 
eggaegctgt 
gagtgacgag 
tacccacgct 
tacccatcgt 
agggt ttaac 
ggagcgcgga 
gagegatege 
ttgggcaccg 
ggcacgegat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1179 



<210> 112 
<211> 1326 
<212> DNA 
<213> E. Coif 



<400 
a tgagtgaga 
caacccggca 
gaatacccct 
gcggtgctgg 
gttcgcgacc 
gagaaactgg 
ctacttaacg 
ategtgaaca 
gcaacgatga 
gagattgecg 
gaaaaatgee 
aaccgccacg 
tgcattctgc 
gaegegggtt 
ggcaaattca 
agcctgcgcc 
atcatggtta 
get cagaacg 
gtccatgacg 
ctgacegtag 
ggtateggea 
accgccagcg 
caatga 



> 112 
tatcccggca 
gcgccgcgct 
ategtcagaa 
tget gattaa 
tgacggcgga 
gcgt tgaccg 
gcctggatgt 
gtgcgctgga 
tcgactggcg 
tactccgccg 
gtccgggaat 
gtgegegcta 
actacaccga 
gtgaata.caa 
cccaggccca 
tgtategtec 
gcggcctggt 
cccatcgtcc 
tgggtgttta 
agecaggget 
ttcgtattga 
tggtgaaaaa 



agagtttcag 
gatttttget 
cagtgacttc 
aagegatgae 
gatctggttt 
cgcact ggca 
ggtttaccat 
aaaactgcgt 
tcctgttgtt 
cgcgggagaa 
gttcgagtac 
tccgtcctat 
aaacgagtgt 
aggttacget 
gcgtgaaatc 
gggaacttcc 
aaaactegge 
tttctttatg 
t ggt caggat 
gtatattgcg 
agacgacatt 
geeggaagaa 



cgtcgccgt c 
gcaccagaag 
tggtacttca 
actcataacc 
ggccgtcgct 
ttcagcgaaa 
gcccagggcg 
aaaggttcgc 
catgaaatgc 
atcaccgcca 
catctggaag 
aacaccattg 
gaaatgcgcg 
ggegatatta 
tacgacatt g 
attctggaag 
atcctgaaag 
catggectta 
cgctcgcgca 
ccggatgcag 
gtgattaccg 
ategaagegt 



aggccctggt 
taacaegtag 
ccggctttaa 
acagegttet 
taggecagga 
tcaatcagca 
aatatgeata 
ggcaaaatct 
gcctgttcaa 
tggcaca'tac 
gcgaaatt ca 
teggcagegg 
acggcgacct 
cccgcacctt 
tgctggagt c 
tcactggtga 
gtgatgt tga 
gccactggt t 
tt ctggaacc 
aagtgccaga 
aaaccggtaa 
tgatggttzgc 



ggagcaaatg 
cgccgacagc 
egaaceggaa 
gtttaacege 
tgccgcgcca 
actttatcaa 
tgctgatgta 
caccgcaccg 
ategecagaa 
aegggegatg 
ccacgaat tt 
tgaaaaegge 
ggtgttgatt 
cccggt caac 
tctcgaaacc 
agt ggtgcgc 
tgaactgatc 
aggactggat 
gggcatggta 
acaatatege 
cgaaaacctc 
tgcgagaaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
.900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1326 



<210> 113 
<211> 585 
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<400 
atgcttatgt 
caacaaggga 
ggcggtaacg 
gctttcggtc 
ctgcaggatg 
ttcgatcggg 
acgcaaccga 
aacattgcgc 
gaagagatca 
ccgcaaccga 



> 113 
ctatacagaa 
cgggt ctgac 
atgacagct c 
atgagctggc 
acggcttcct 
ctgatgcatt 
agctggataa 
aactgggtt a 
tcgaatacgt 
ccgcgccaga 



cgaaatgcct 
cccagctgag 
atggctaccg 
acaggcactg 
ttttcagctt 
ggcaggttgg 
agtgaccggc 
cgacgaagac 
tcgtgttgcc 
agtacaaaaa 



ggttacaacg 
atgcatggtt 
ct act tcacg 
cgtaaaatgc 
tatctgcctg 
gtcaatcact 
gaaaccggtg 
gaagatcagg 
gcgctgttat 
ccgactctac 



aaatgaacca 
taatcagcgg 
acctgacgaa 
actctgccac 
atggcgatga 
tcctgct tgg 
aagctat cga 
aagagcttga 
gccacgacac 
actaa 



gtatctgaac 
gatgat atgt 
cgaaggcat g 
cagcgatgcc 
tgtcagcgtt 
tcttggcgtt 
cgatctgcgt 
aatgtcgctt 
ctttactcat 



.<210> 114 ' 
<211> 363 
<212> DNA 
<213> E. Coli 

<400> 114 
atgttaaagc tatttgcaaa 
gtggtttttg gtgtttgtat 
ggtttcgttg- tggctgtgag 
tcgactacaa cgatgcgcta 
gttggatggg ctgctgatag 
gccatcagcc tggtgtgcgg 
tga 

<210> 115 
<211> 921 
<212> DNA - 
<213> E. Coli-'' 



gtacacctct 
ctatgt cgcg 
ctttagcttc 
catgctatat 
atgcgcactt 
tttcgtctat 



a ttggtgtgc 

catacaaacc 

t tcgcgaatg - 

gttgggttca 

cccccgatga 

tcaaagttca 



tgaacaccct 
aagctcttgc 
caaaat tcac 
tggggacact 
taactcttgt 
ttgtctttag 



tatacactgg 
aaacttcgca 
attcaaggca 
gagtgctact 
caccttctcc 
ggatgcgaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
585 



60 
120 
180 
240 
300 
360 
363 



<400 
atgaagatat 
aaaacggtac 
gacggcagca 
gttgttccgc 
gaccatgcaa 
gttattcctc 
tctgaccgct 
cacaataaaa 
cgtgatgttg 
ctgagctggg 
ggagatacaa 
ttttccacat 
tttatttatg 
tatccttcac 
gtattaggtg 
atcaagagag 



> 115 
ctcttgtagt 
gtgaattcga 
aagacgctac 
tgtcatttac 
ccggggatgc 
atcttattga 
caactgatgg 
taagcaatcc 
tcgaaaatat 
taggaggaaa 
aatttaatgg 
tccct cttcg 
gggcgtggat 
tacttgtttc 
aatatattgg 
tcaaaaaatg 



tcctgt cttc 
agaattgaag 
ggagtcaatc 
acgcaacttt 
gataat ccca 
aaaatggcaa 
acgcct gaag 
taaaattgaa 
taaacttatg 
gacagatatt 
atggaaactt 
catctggaca 
gattttagat 
aat actgttt 
acgcacatac 
a 



aa t gaagaag 
tcatatgaag 
at taatgctc 
ggt aaagaac 
at tgatgttg 
gcaggtgctg 
cgaaaaacgg 
gagaatgttg 
ccagaacgaa 
gttgaatacg 
tggaatttag 
tacatagggt 
actatcatat 
ttaggtggaa 
attgaaacca 



aagcgatacc 
tggaaatcgt 
tggctgtttc 
cagcattgtt 
acctgcaaga 
atatggttct 
ctgagtggtt 
gtgattt cag 
accttttcat 
tgcgagcgga 
cacttgaggg 
tagtggtagc 
tt ggaaatgc 
ttcagatgat 
aaaaacgccc 



aattttttat 
tttcataaat 
agat cct eta 
tgcagggtta 
cccgatt gag 
tgctaaaaga 
ctataagctc 
gctgatgagc 
gaaaggtatt 
aagaatt get 
t attacaagc 
cagtgtagca 
tgttagggga 
tggaatagga 
gaaat acatc 



60 
120 
180 
240 
300 
360 
4 20 
480 
540 
600 
660 
720 
780 
840 
900 
921 



<210> 116 
<211> 1332 
<212> DNA 

<213> E . Coli --...* 
<400> 116 

atgaataaag caataaaagt atcattgtat atatcttttg ttttgattat ttgegectta 60 
tctaaaaaca taatgatgtt aaatacatct gattteggaa gagecattaa gecattaatt - 120 
gaagacatac cagcatttac atatgactta cctttattgt ataaattgaa aggtcatatt 180 
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gattcaattg 
ctgtttatta 
gtaatatata 
gtaactttat 
atgtttgcat 
tcattaacat 
tctactgcta 
tttgatagac 
at atttgcaa 
ttcggtagtt 
aagtgtgttg 
acagaagttg 
tttttattgg 
tct cagtata 
tcaaacatac 
atatcattgt 
tctttatt-tg 
ggatatagag 
tacataacag 
gtaaagcact 



atagctatga 
gcatttttac 
tttattcatt 
ttacattctt 
cattctatca 
gcaaaaacaa 
aaaat caatt 
acaaact aat 
tatcttattc 
atctttatat 
ggttagatgc 
gaacggaatg 
ttagcaaacc 
aagaaaatt a 
taacgactat 
tattattttt 
tagtatctct 
atttaagcaa 
tcgttttttt 
aa 



gtatataagt 
tgaatatct t 
atatgcgata 
tattttagct 
agagcaaata 
taaatctatg 
tatat taacc 
tattaaatct 
aaaaggtgtt 
gaaaaacaac 
ctggggtaat 
tttcgaatct 
aagcaccat c 
tttccatgta 
tactaacat a 
tattgcttct 
ttttggaat a 
gcatttattt 
aattt ataaa 



tcatatagtt 
gatgctaggg 
tttacttcat 
tttcttatgt 
gttataattt 
cttttgctat 
ccactaatag 
gtaatatgcg 
gttgaattaa 
gggtataaaa 
aaat tcgaca 
cataaaga t g 
ttcaaacttc 
tataaaaaac 
aaagacaata 
atttttatta 
tctcaatttt 
ggaatgtat t 
ataattcaaa 



atattttgta 
tgttatcgtt 
atataaaaac 
g.ttcttcatc. 
tccttccatt 
ttttttcgtt 
tgtattcata 
tggtgtgctt 
ataagtacca 
tgccatcgta 
t atcatt tgg 
aaacgttttc 
catttgatga 
tacacgtaat 
tatttaaaaa 
gaaataataa 
atgtgtcatt 
tttcgttcga 
gaaatcaaga 



tacat acgtc 
atttctaaaa 
agaaaggtat 
aacactgtca 
tttggtgtat 
gctaa taata 
ttatattttt 
gcrtgcgtca 
tgcaacatac 
tgttgatgat 
cgcaacccca 
gaat gcactc 
tggtgtgatg 
atatggagaa 
cattagattt 
aataaaggca 
tttcggggaa 
ccttt gctta 
caat agcgat 



<210> 117 
<211> 249 
<212> DNA 
<213> E. Coli 

<400> 117 • 

atgggcattc tgtcatggat tatttttggg cttattgccg gtattctggc gaagtggatc 

atgccaggta aagatggagg tggattcttt atgactatcc tgctggggat agtcggtgcc 

gtagtcggcg gatggatcag cacgctgttt ggctttggta aagtcgatgg cttcaatttt 

ggcagcttcg tggttgccgt tattggtgcg attgtcgtgc tatttatcta caggaagatt 
aaaagttaa. 

<210> 118 
<211> 183 
<212> DNA 
<213> E. Coli 

<400> 118 

atgggcaaag caacgtatac cgtgaccgtc accaataaca gcaatggcgt ttctgtcgat 

tatgaaacag agacgccgat gactttgctg gtgccagaag tggcggctga agtgataaaa 

gatctggtga ataccgtacg ttcttatgac acggaaaacg aacatgatgt ttgtggttgg 
taa " 

<210> 119 
<211> 360 
<212> DNA 
<213> E. Coli 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1332 



60 
120 
180 
240 
249 



60 
120 
180 
183 



<400> 119. 
atgcttcaaa tcccacagaa 
actgcacctg ccggaatatt 
ccacgccttt ccgttatgca 
gcagagcctg atcaggtgat 
aagtggcaca acattgaagc 
gctcctgaag tcctgatgga 

<210> 120 

<211> 741 

<212> DNA 

<213> E . Coli 



ttatattcat 
cgaacgtcat 
tggggcggtc 
ccttatcgaa 
catgactgac 
aggtgcgcaa 



acgcgctcaa 
ct tgat aaag 
aaatat ct eg 
gcggggcagt 
gatacttatt 
caaeggaaag 



cgcctttctg 
gaacgcgccc 
getaegctga 
ttgcggtgtt 
tcaacattga 
tcattcataa 



gaataaacaa 
gggggtttac 
tgaacacagt 
ccct ccagaa 
cttctt cgtg 
egggaaatga 



60 
120 
180 
240 
300 
360 
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<400 
gtgaagttca 
gctaacgaat 
gtgccagaca 
gccaagaaac 
atcgcgaaaa 
gatggtaaaa 
cagttagaca 
cgtt ct gt gt 
gcgattgata 
gggccggtat 
atgatgaaag 
cagtttgatg 
aaaacacctg 



> 120 
aagttatcgc 
tgccggatgg 
ttgccactct 
aggcagatga 
aagatatcag 
gtatccttaa 
aactgaattc 
cgctgggcgt 
acgcgattca 
atagcgtgcg 
ccgatgccgc 
atcaggtcga 
cagcacaata 



cctggcggca 
accgcatatt 
tgcgattgaa 
gcgcgtcgca 
ctcagcgaac 
aggctaccgc 
cttgctggat 
ggcgcagccg 
tcaggcgcag 
ctaccatgtt 
gccggtgtcc 
tgtggtcttc 
a 



ttaatgggta 
gtcacctccg 
gttaacgtgg 
caatacattt 
t tacgcaccc 
gctgtgagaa 
ggcgcgctga 
gatgcctata 
gaactggcga 
tccaactatc 
gcccaggaaa 
cagttagaac 



ttagcgggat 
gtacggcaag 
ccgcgaagga 
ccttccttga 
agccagatta 
cggtggaagt 
aggcgggtct 
aagacaaagc 
acgqctttca 
agcccagccc 
cttacgagca 
ctgtggatca 



ggcagcgcag 
cgtggatgcg 
tgccgctact 
actcaatcag 
tgattatcag 
cacgctccgt 
taacgaaatt 
gcgtaaggca 
tcgtaaactg 
aatggtgcgg 
ggccgctatt 
acaacccgct 



<210> 121 
<211> 1395 
<212> DNA 
<213> E. Coli 



<400 
gtgttattac 
tttcataatt 
acgt t t cagg 
gtaatcgtgg 
tcccggagtg 
cgtggt atgc 
gttccgcgtt 
agccgcgtgg 
ggcgatttac 
ggccaactgg 
caacaactgg 
gtcagtgaaa 
caact cgact 
gatgttggta 
catcctatcg 
cagaaagccg 
agtgaaggca 
gtgaaagcac 
cgcatgttag 
ggcaatgaag 
gtgacgccgg 
gatcgcgt gg' 
gaagcccaga 
ggagcacgct 



> 121 
tggatgcgtg 
cctccatttt 
aagagaaact 
tggttatcgc 
cagccccagg 
gttccggccc 
acctcaccgg 
acggccaact 
tggcagaaat 
caaaagataa 
caaaaaccaa 
ccgaaggcac 
ggagccggat 
accaaatctc 
atttagtctt 
gaaaacegct 
cgctgttaag 
gctttaataa 
tcgacaccga 
gccattttgt 
gcattcagga 
tgacagacgg 
gcgccactac 
cctga 



ct cgcaaatg 
tctcccttat 
cttaacgat g 
cgccatcgcc 
ggcgacgaaa 
attagccccg 
gcttggcacc 
gatagcgtt a 
tgaccccagc 
agccacgctt 
tct cgtttcc 
cattaaggct 
t accgcacca 
cagtggtgat 
taccctgccg 
ggt ggtagaa 
tct agataac 
tcaggatgat 
acaaaacgcc 
ctgggtgctg 
cagt cagaaa 
catt gatcgc 
tccggaagag 



tgcccgt'cat 

tggctggcta 

aaaggtagtt 

gcattctggt 

caagcgcagc 

gtt caggcgg 

attaccgccg 

cat tt ccagg 

cagttcaaag . 

gccaacgccc 

cgccaggagc 

gatgaagcaa 

gtcgatggtc 

accaccggga 

gaaagcgata 

gcctgggatc 

caaatcgatg 

gcgctgtttc 

gtagtgatcc 

aatagcgaaa 

gtggtgatcc 

ctgaccgaag- 

aaagccacca 



t cagacgatt 
cactagtatc 
ataaat cccg 
t ctggcaagg 
aatcgccagc 
cgaccgccgt 
ctaataccgt 
aaggccagca 
ttgcatt age 
gccgtgacct 
tggatgecca 
gcgttgccag 
gcgttggtct 
tcgtggt gat 
t cgctaccgt 
gcaccaactc 
ccactaccgg 
ccaatcagtt 
caacagccgc 
acaaggtcag 
gtgeaggtat 
gggcgaaagt 
gecgegaata 



ccagacagt g 

attccgegaa 

ttgggtaatc 

ccgcaatgac 

gggtggtcga 

agaacaggca 

tacggtgcgc 

ggtcaaagca 

acaagcccag 

ggcgcgttat ' 

acaggegctg 

cgcgcagctg 

caagcaggtt 

cacccagacg 

agtgcaggcg 

gaagaaatta 

tacgattaaa 

tgttaacgcg 

ectgeaaatg 

caaacat ctg 

ttctgeggge 

ggaagtggtg 

cgcgaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
741 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1395 



<210> 122 

<211> 3123 

<212> DNA 

<213> E. Coli 



<400 
atgcaggtgt 
gtggccacca 
ctgcccgttt 
ccaggtgcca 
gggcagatgt 
actttgeagt 
attaaegctg 
gtgaaccegg 



> 122 
t acccccgag 
egctgetgat 
cggcgctgcc 
geceggatgt 
ctggcctgaa 
tccagctaac 
cgaccaactt 
cagatccgcc 



cagcacaggc 
ggtggcgatc 
ggaagtggac 
catgacctct 
acagatgtcg 
at taccgctc 
gt tgecgage 
gatcatgacg 



ggcccgtcgc 
ttact cgccg 
tatccgacca 
gccgttaccg 
tegcaaagtt 
gatgt cgccg 
gatctgecta 
ctcgccgtca 



gcctgtttat 
ggattategg 
ttcaggtggt 
cgccgctaga 
ccggcggtgc 
agcaggaagt 
acccgccggt 
cctcaaccgc 



t atgcgt cct 
ttat cgcgcc 
cacgctctac 
acgccagttc 
gtcagttatc 
gcaggccgcg 
tt acagcaaa 
catgecgatg 



60 
120 
180 
240 
300 
360 
420 
480 
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acgcaagtgg 
gt cggcctgg 
caggcgattg 
aacgttaact 
aacgaccaga 
gcgccaattc 
•ggcgcgtggg 
aacattatct 

. ccgaaatcgg 
gatgatactc 
tttttgcgca 
ggcactttcg 
t taactatcg 
cgctatatcg 
ggctttacca 
atgggcgata 
ttgatctcag 
caggagtcgt 
ataatcgccg 
agcgtggcac' 
ttcttcccgg 
tcctttgcca 
gcagtgcaaa 
gcacgtttac 
at cgcccgtc 
acgcaggatc 
gccacgtcac 
ctgccacagc 
aatgttgatc 
ctgtacaacg 

"cgcgtggtgc 
cgcctgacca 
cgt tt tgcgc 
aacgtgccgg 
acgctgaatc 
tcggcgctgg 
ctcggcattc 
gcaggggt tg 
at: tat cggca 
ttcgcgctgg 
tgtctgttgc 
ccgctgatgt 
gtcggcggtc 
ctgttcgacc 

taa 



aagatatggt 

tgacgctttc 

ccgccctcgg 

cggcaaaagg 

tgcaatccgc 

gtctgggcga 

cgaacaaaga 

ccaccgccga 

tgaaggtgac 

agtt tgaatt 

atatt ccggc 

cggttatggt 

ccaccggatt 

aaaaaggcga 

ttatct cgct 

tcgtcgggcg 

cggtggtgtc 

tgcgt aaaca 

cctatggtcg 

tcagcacgct 

tacaggacaa 

at atggccca 

gcctgacctc 

aaatcaacct 

tgcaaacggc 

tgactattga 

tggatgcgct 

tttctgatgt 

gcgacagcgc 

cgtttggtca 

tggagcacaa 

gcagcga.cgg 

cgctctccat 

ataactattc 

tgccggt gga 

gcagcactgt 

tgtacgagag 

gcgcactgct 

ttattttgct 

ctgct gagcg 

gttttcgtcc 

tgagtaccgg 

tgattgt cag 

gcctggcatt 



ggaaacccgc 

cggcggt cag 

cctgaccagc 

tagcctcgac 

cgaagagtat 

t gt cgcaact 

acaggccatt 

cagcattcgg 

.agtgctttcc 

gatgatggct 

gaccatcatt 

gtttctcgat 

cgtggtcgat 

aaaaccgttg 

gacct tctca 

actgttccgc 

gctgaccctg 

gaaGcgcttc 

tggactggcg 

gctgct tagc 

tggcattatt 

gcgaqaacgc 

atttgttggc 

caaaccgttg 

ggtagataaa 

tactcaggtc 

cagtacctgg 

ctccagcgac 

cagccgtctg 

gcggctgatt 

caccgaaaat 

cggcgtggtg 

caaccat ct g 

gctgggcgat 

tatcaccacg 

ctggctgatt 

ctttattcac 

ggcgttgctg 

gatcggtat c 

cgagcaaggc 

gatcctgatg 

ggtcggcgcg 

ccaggt gctg 

gtggaccaaa 



gtcgcgcaga 

cgtccggctg 

gaaaccgtgc 

ggcccttccc 

cgccagctaa 

gtagagcaag 

gtgatgaatg 

cagatgctgc 

gatcgcacca 

atcgcgctgg 

cccggtgttg 

tttt caat ca 

gacgccatcg 

gcggcggcgc 

ctgattgcgg 

gaatttgct a 

acaccgatga 

tcccgtgcct 

aaagtgctga 

gtgctgctgt 

cagggcactt 

caggtcgcgg 

gttgatggca 

gatgaacgtg 

gtgccgggcg 

agccgcaccc 

gtgccacagt 

tggcaggaca 

gggatcagca 

tccactattt 

accccaggcc 

ccgctaagct 

gatcagttcc 

gcggtgcagg 

cagttccagg 

gtcgcggcgg 

ccgatcacca 

attgctggta 

gtgaagaaga 

a tgt cgccgc 

accact ctgg 

gaactgcgtc 

acgctgttta 

agccgctttg 



aaatctcgca 

ttcgcgtcaa 

gcaccgccat 

gtgcggtcac 

t catcgccta 

gtgcagaaaa 

t tcagcgcca 

cacagctcac 

ccaatat ccg 

tagtcatgat 

ctgtaccgct 

ataacctgac 

tggtgatcga 

tcaagggcgc 

tgttgat ccc 

ttaccct ggc 

t gtgcgcgcg 

cggaaaaaat 

a teat ccgtg 

gggtgttcat 

t gcaggcacc 

acgtgatttt 

ctaacccgtc. 

atgatcgggt 

tcgatctctt 

agtaccagtt 

tga t ggaaaa 

aagggctggt 

tggcggatgt 

at actcaggc 

t.cgcggcgct 

caattgecaa 

eggtaacgae 

cgattatgga 

gcagcaccct 

tggtggcgat 

ttctctcgac 

gcgaactgga 

acgccatcat 

gcgaggcaat 

cggctctgct 

gtccgttagg 

ccacgccggt . 

cccgt catga 



PCT/US00/02200 

gatttcegge 540 

acttaacget 600 

taccggcgct 660 

gcttt ccgcg 720 

ccagaacggc 780 

cagctggctc 840 

gcccggtgct 900 

tgagagtctg 960 

cgcatccgtc ■ 1020 

tatctacctg 1080 

gtcgttaatc 1140 

actgatggcg 1200 

aaacatttcc 1260 

aggtgaaatc 1320 

actgctgttt 1380 

ggtagegatt 1440 

gatgetcage 1500 

gttcgacagg 1560 

gctgacctta 1620 

tccgaaaggt 1680 

gcaatccagc 1740 

gcaggatccg 1800 

gctgaacagt 1860 

gcaaaaagtc 1920 

cctgcaacca 1980 

taccttgeag 2040 

actccagcaa 2100 

ggcgtatgtc 2160 

cgataacgcc 2220 

caaccagtat 2280 

ggataccatt 2340 

aattgagcag 2400 

catctccttt 24 60 

caccgaaaag 2520 

cgccttccag 2580 
gtatatcgtg . 2640 

gctacccacc 2700 

tgtgattgcg 2760 

gatgatcgac 2820 

ctaccaggct . 2880 

tggcgcgctg 2940 

tateggcatg 3000 

gatttatttg 3060 

agaggaggcg 3120 
3123 



<210> 123 
<211> 3078 
<212> DNA 
<213> E . Coli 



<400 
gtgaagtttt 
attaccctgt 
gattttcegg 
t cttccgttg 
acctccagca 
at caaeggeg 
agtgggatgc 
atcctcacgc 
cagctggctc 



> 123 
ttgccctctt 
geggcatact 
tgattatcgt 
ccacgccgct 
gttegctegg 
cagcgcgtga 
ccagccgccc 
tgaegtcega 
cgacgatttc 



catttaccgc 
gggcttccgt 
cagcgcctcg 
ggagegctea 
cagcacgcgt 
tgtgcaggcg 
gacctatcgc 
tacttattcg 
gcaaat cgac 



ccggtggcga 
atgetgeegg 
ctgcccggt g 
ettgggegea 
attattttgc 
gcgatcaacg 
aaagegaace 
cagggtgaac 
ggtgttggtg 



cqattttact 
tcgccccgct 
cgtcaccaga 
ttgccggagt 
agtttgattt 
ctgcacaaag 
cgtcggatgc 
tgtacgat tt 
atgt-cgatgt 



gtcggttgcc 
gccgcaggtc 
aacaatggcg 
cagtgaaatg 
tgacegggat 
tttgctgccc 
gecaattatg 
cgect cgacg 
eggaggcage 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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tcactgcccg ccgWCgcgt cgggctgaat ccgcaggcgc 
ctggacgacg tacgcaccgc cgtcagcaat gccaacgtgc 
gaagatggca ctcaccgctg gcagatccag accaatgatg 
tatcagccgt tgattattca ctacaacaac ggcggcgcgg 
acggtgaccg actcagtgca g'gatgtgcgc aacgccggga 
attttactga tgatccgcaa actgccggaa gccaatatta 
cgggcaaaat taccggagtt gcaggaaacc attccggcgg 
caggatcgct cccccaccat tcgcgcctcg ctggaagaag 
tcggtggcgc. tggtgattct ggtggtgtt t ttattcctgc 
attcccgccg tttcggtgcc ggtttcgctg attggtacgt 
ggattcagtc tcaataacct ttcgttaatg gcgctcacca 
gatgacgcca tcgtggtgct ggaaaacatt gcacgtcatc 
ttgcaagccg cactgcaagg tactcgcgaa gtcggtttta 
tcactggtgg cggtgttcct gccgctgctg ttgatgggcg 
cgcgaatttg ccgtgacgct ttctgtcgcc attggtatat 
ttaacgccaa tgatgtgtgg ctggatgctg aaagccagca 
ctgcgtggtt ttggtcgcat gttggtagcc ctgcaacaag 
tgggtgctca atcatacccg tctggtgggc gtggtgctgc 
atctggctgt a tatctcgat cccgaaaacc ttcttcccgg 
atgggcggga ttcaggcgga tcagagtatt tcgtttcagg 
gatttcatga aaattatccg tgacgatccg gcagtggata 
ggttcgcgag tgaacagcgg gatgatgttt atcaccctca 
gaaacggcgc agcaaattat cgaccgtctg cgcgtaaaac 
aatctgttcc tgatggcggt acaggatatt cgcgttggtg 
taccagtaca cgttgttatc cgacgacctg ■ gcggcactgc 
cgcaaaaaac tggcgacgtt gccggaactg gcggacgtga 
ggcgcggaga tgaatctggt ttacgaccgc gacaccatgg 
caagccgcca acagtctgtt aaataacgcc ttcggtcagc 
cagccgatga accagtataa agtggtgatg gaagtggatc 
agtgcgctgg aaaaaatgtt cgttatcaat aacgaaggca 
ttcgctaaat ggcaaccggc gaatgcccca ctatcggtga 
gcctcgacca ttt cgtttaa cctgccgacc ggaaaatcgc 
atcgatcgcg caatgaccca gcttggtgtg ccttcgacgg 
acggcgcagg tgttccagga gacgatgaac tcgcaggtga 
gccacggtgt atatcgtgct gggtatcctt tacgagagtt 
ctctccaccc tgccctcggc gggcgttgga gcgctgttgg 
ccgttcagcc taatcgccct gatagggatc atgctattaa 
gccattatga tggtcgattt tgcgcttgaa gcccaacggc 
gaagctattt tccaggcctg tctgctgcgt tttcgcccga 
gcgctgtttg gtgcgctgcc gctggtattg tcgggcggcg 
cccctgggga tcaccattgt cggcggactg gtaatgagcc 
acgccggtgg tgtatctctt tttcgaccgt ctgcggctgc 
caaacggtaa ccgagtaa 



tgtttaatca 

gtaaaccgca 

agctaaaaac 

ttcgtctggg 

tgaccaacgc 

tccagacggt 

cgattgatct 

tcgagcaaac 

gctcgggtcg 

tt gcggcgat 

tcgctactgg 

tggaagcggg 

cggtgctgtc 

ga 1 1 gccggg 

cgttgctggt 

agccgcgcga 

gctacggcaa 

ttggcaccat 

agcaggacac 

cgatgcgcgg 

atgtcaccgg 

agccacgcga 

tggcgaaaga 

ggcgtcagtc 

gagaatggga 

actccgatca 

cacggctggg 

ggcaaatctc 

cgcgctatac 

aagcgat ccc 

atcat caggg 

t ct cggacgc 

tgcgcggcag 

tcctgattat 

acgtacatcc 

cgctggagct 

t cggcat cgt 

acggtaacct 

t tatga tgac 

acggctcgga 

agctccttac 

gtttttcgcg 



gggcgtgtcg 

gggcgcgctg 

cgccgctgaa 

cgatgtggcg 

caaaccggct 

tgacagcatc 

gcaaattgcc 

gctgattatc 

cgccactatt 

gtacctgtgc 

tttcgtggtg 

aatgaaaccg 

gatgagtctg 

ccgactgtta 

ttctctigaca 

gcaaaagcga 

gtcactaaaa 

tgcgctgaat 

tggcgtgttg 

taagttgcag 

ctttacaggc 

cgaacgcagc' 

accgggggcg 

gaacgccagc 

gccgaaaatc 

gcaggataac 

aatcgacgt a 

gaccatttac 

ccaggacatc 

gctgtcgtat 

attatcggcg 

cagtgcggcg 

ttttgccggc 

cgccgccatc 

gctgacgatt 

gttcaatgcc 

gaagaaaaac 

gacgccgcag 

taccct ggcg 

gctgcggcaa 

gctgtatacc 

taaacctaaa 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24 00 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3078 



<210> 124 
<211> 1416 
<212> DNA 
<213> E . Coli 



<400 
atgacagatc 
atgcagt cgc 
ggggaaagtc 
atgctgcccg 
gccatcgtgc 
ctgttgctgg 
. ttgacggtga 
ttacccggtc 
gcatcgtggc 
acattgctgt 
ttattgctgg 



> 124 • 
ttcccgacag 
tggacaccac 
cgtt gcatat 
ccagcggctg 
tgtttactct 
cacgcgcgtt 
tgaaaatcgt 
aggtcggtcc 
actggatctt 
taatgccgaa 
cggttggcat 



cacccgttgg 
catcgtaaac 
gcacatggtc 
gctggcggac 
cggt tcactg 
acagggcgtt 
accgcgcgag 
gctgctcggt 
tttgatcaac 
ctacaccatg 
ggcggtatta 



caat tgtgga 
accgcccttc 
attgtctctt 
aaagtcggcg 
ttttgcgcgc 
ggcggcgcga 
caat atatgg 
ccggcgct eg 
att ccggtgg 
cagacgcggc 
accctggcgc 



ttgtggcttt 
cctcaat ggc 
atgtgct gac 
tgegcaatat 
tttccggcac 
tgatggtgcc 
eggegatgae 
geggtctget 
ggattat egg 
gctttgatct 
tggacggcag 



eggett ct'tt 
gcaaagcctc 
cgtggcggtg 
tttctttacc 
gctgaacgaa 
ggteggcaga 
ctttgtcacg 
ggtggagtac 
tgegat cgee 
etceggattt 
taaaggtaca 



60 
1.2 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ggtttatcgc cgctgacgat tgcaggcctg gtcgcagttg gcgtggtggc actggtgctt 720 

tatctgctgc acgccagaaa taacaaccgt gccctgttca gtctgaaact gttccgtact 780 

cgtacctttt cgctgggcct ggcggggagc tttgccggac gtattggcag tggcatgttg 840 

ccctttatga caccggtttt cctgcaaatt ggcctcggtt tctcgccgtt tcatgccgga 900 

ctgatgatga tcccgatggt gcttggcagc atgggaatga agcgaattgt ggtacaggtg 960 

gtgaatcgct ttggttatcg tcgggtactg gtagcgacca cgctgggt-ct gtcgctggtc 1020 

acccrgttgt ttatgactac cgccctgctg ggctqgtact: acgttttgcc gttcgtcctg 1080 

tttttacaag ggatggtcaa ctcgacgcgt ttctcctcca tgaacaccct gacgctgaaa 1140 

gatctcccgg acaatctggc gagcagcggc aacagcctgc tgtcgatgat tatgcaattg 1200 

tcgatgagta tcggcgtcac tatcgccggg ctgttgctgg gactttttgg ttcacagcat 1260 

gtcagcgtcg acagcggcac cacacaaacc gtctttatgt acacctggct tagcatggcg 1320 

ttgatcatcg cccttccggc gttcatcttt gccagagtgc cgaacgatac gcatcaaaat 1380 

gtagctattt cgcggcgaaa aaggagcgcg caatga 1416 

<210> 125 
<211> 1035 
<212> DNA 
<213> E . Coli 

<400> 125 

atggaaattc gcataatgct atttatatta atgatgatgg ttatgcctgt gagctatgcg 60 

gcatgttata gtgagttatc tgttcagcac aacttggttg ttcaggggga ttt t gcactt 120 

actcaaacac aaatggcgac atatgagcat aattttaatg attcgtcatg cgtaagtaca 180 

aatactatca cccctatgag cccgtcggat attattgttg gactttataa cgataccata 240 

aaattaaatt tacattttga atggaccaat aaaaacaaca tcacgttgtc aaataatcag 300 

accagtttca ccagtggtta ttcagttacg gtgacacctg cggccagtaa tgcaaaagtg 360 

aatgtttctg cggggggcgg cggttcagtg atgattaatg gtgttgcgac attatccagt 420 

gcttcatcat cgacacgcgg gagtgccgca gtacaatttc tactgtgttt attaggtggc 480 

aagtcatggg atg.catgtgt aaatagctac agaaatgcat tggcacaaaa tgcaggtgtc 540 

tattccttta atctgacatt gtcatacaac ccgataacca caacctgcaa accggacgat 600 

ttattaatta ctttagacag tattcccgtt tcacaattac cagccacagg taacaaagca 660 

acaataaata gtaaaca.agg ggatattatt ctgcgttgta aaaatttatt aggtcaacaa 720 

aatcaaacat cacggaaaat gcaggtgtat ttatcaagtt ctgacttgtt aaccaacagc 780 

aacacaatac tgaaaggtgc ggaagataat ggcgtaggat ttattcttga aagtaatggt 840 

tcgccagtca cacttttaaa tatcactaac agcagtaaag gatatacaaa tttaaaggaa 900 

gttgcggcga agtcaaaact tacagataca acggtttcaa ttccgataac agccagttac 960 

. tacgtctacg atacaaacaa agttaaatct ggcgcactgg aggcaaccgc attaatcaac 1020 

gtgaaatacg actaa 1035 

<210> 126 
<211> 2481 
<212> DNA 
<213> E. Coli 

<400> 126 

atgttgagaa tgaccccact tgcatcagca atcgtagcgt tattgctcgg cattgaagct 60 

tatgcagctg aagaaacctt tgatacccat tttatgatag gtggaatgaa agaccagcag 120 

gttgcaaata ttcgtcttga tgataatcaa cccttaccgg ggcagtatga catcgatatt 180 

tatgtcaata agcaatggcg. cgggaaatat gagattattg ttaaagacaa cccgcaagaa 240 

acatgtttat caagagaagt tatcaagcgg ttaggcatta atagcgataa cttcgccagc 300 

ggtaagcaat gtttaacatt tgagcaactt gttcagggtg ggagctatac ctgggatatc 360 

ggggtttttc gtctcgattt cagtgtcccg caggcctggg tggaagaact ggaaagtggc 420 

tatgttccac cggaaaactg ggagcggggt attaatgcgt tttatacctc ttattatctg 480 

agtcagtatt acagcgacta taaagcgtcg ggtaataaca agagtacata tgtacgtttt 540 

aacagcgggt taaatttact ggggtggcaa ctgcattctg atgccagttt cagtaaaaca 600 

aataacaatc caggggtgtg gaaaagcaat accctgtatc tggaacgtgg atttgcccaa 660 

cttctcggca cgcttcgcgt gggtgatatg tacacatcaa gcgatatttt tgattctgtt 720 

cgcttcagag gtgtgcggtt gtttcgtgat .atgcagatgt tgcctaactc gaaacaaaat 780 

tttacgccac gggtgcaggg gattgctcag agtaacgcgc tggtaactat tgaacagaat 840 

ggttttgtgg tttatcagaa agaggttcct cctggcccgt tcgcgattac agatttgcag 900 

ttggccggtg gtggagcaga tcttgatgtc agcgtgaaag aggcggacgg ctcggtaacc 960 
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acctatctgg 

gat ttagcgg 

gcgggttatc 

aataattatt 

gtcgatgcca 

tatcaaattg 

tzggcgttatt 

gataattatc- 

tttggccgca 

t ctgtgtcat 

tatcagttga 

gcttatgacg 

gattggggtg 

tttgatgatc 

gatcagttca 

ggggcgaatt 

agtacttatc 

gttaatctgg 

gatgcttatg 

gacggaatga 

gaagcagaat 

gttaattttg 

caatcattaa 

ggccagggaa 

attgataagc 

agaaattata 



tgccttatgc 
cgggtcgtag 
agtatggttt 
acgcgtttac 
ctaagtcgca 
cctacaacaa 
cgtcgcgtga 
gccgtgatga 
aaaatagctt 
taagtacgtt 
gttattccaa 
agaatcatca 
atgacgtt tc 
aggggtttgc 
attatggtgt 
tgacctggaa 
gacaggctgg 
cgaaccgtct 
tcaatgggca 
caccttatcg 
tacgtggcaa 
ataccgatca 
cgtttggtta 
gtcagttatt 
aacaaggact 
tttgceagta 



agcggtgcca 
ccatat tgaa 
taataattta 
tttgggggct 
tagtaaacaa 
atttgtgagc 
ttaccggaca 
aaacgatgt c 
ttccgccaat 
atggcgagat 
caacctgcga 
tgaagagaaa 
•gacgcctcgt 
ctcaaataat 
caacctgagt 
cgcgccggtt 
agccagtgt t 
ttccgaaacg 
aaaatatcgc 
ggaaaatcac 
ccggaaaatt 
gcgcaagcca 
tgaagtcaat 
tattcgcacc 
ttcatgcaca 
a 



aatatgctgc 
ggggcgagca 
ttgacgctgt 
ggctggaata 
gacaacggcg 
caaacgt cga 
tttaacgatc 
tatgacattg 
atgagccagt 
tactgggggc 
cggataagct 
cgttttaata 
cggcaaatat 
acgggattat 
catcaacatc 
gcgacagtga 
tcagggggca 
tttgct gtga 
acaacaaacc 
ctgatgctgg 
gccgcccctt 
tggtttataa 
gatat ccatg 
aatgaagtac 
atcaccttcg 



aacccggcgt 
aacaaagtga 
atggt ggctc 
cacgcattgg 
atgtgtttga 
cgcgttttgg 
acgtt tgggc 
ccgattatta 
cattgccaga 
gtagcggcag 
ataccctcgc 
tttttatatc 
atatgtctaa 
caggaacagt 
agggaaatga 
atggcagtta 
ttgtcgcctg 
tgaat gcgcc 
gtaat ggagt 
atgtgtcgca 
atcgcggcgc 
aagcgttaag 
gtcat aatat 
cgccat cggt 
gtaaagagat 



gtcgaaatat 
ttttgt ccag 
gatggtcgcg 
tgccatttcc 
cgggcaaagt 
tctggcggcc 
aaacaataaa 
ccagaacgat 
aggttggggg 
tagtaaggat 
ggcaagccag 
gattcccttt 
ctcaacgacg 
agggagtcgg 
aacgacagct 
tagtcagtcg 
gtcgggtggc 
aggaattaaa 
ggtgatatac 
aagcgatagc 
ggttgtactg 
agcagat.ggg 
tggcgttgtc 
taatgtggca 
tgatgaaagt 



1020 

1080 

1140 

1200 

12 60 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460. 

2481 



<210> 127 
<211> 720 
<212> DNA 
<213> E . ■ Coli 



<400> 
at ggccgcta 
tctttactca 
cgcattattt 
cgttcttcgc 
attcaggttc 
caagtaaaaa 
ctgaatgttc 
tttgcgatgc 
aaagcgacat 
gactcagcta 
gaaactat ta 
aataactggc 



127 
tcccatggcg 
ttttttctat 
acccggcaga 
tgctgcaggc 
ctttcatgtt 
tcaaaattat 
tggatattcc 
aaaacagaat 
ttaaaaaatt 
attgggtgac 
tgattgcccc 
atctgaccat 



gccttttaat 

ggtccttcct 

aaataaagaa 

gtggattgat 

aacgccacca 

gccgaataaa 

accaaat agt 

taagttgttt . 

gctggtaaat 

gat-ttcggat 

cttagaaagt 

tatcgatgac 



tt aagaggca 
gcacatgccg 
gtgatggtgc 
gatggcgata 
gtggcaaaaa 
ctgcccacta 
ccagagcaag 
t accggccag 
cgcagtggca 
gtcaaagcta 
cagagtgtta 
catggcaact 



t taaaatgaa 
gaattgttat 
agttgatgaa 
cgtcattacc 
taggggcaaa 
ataaagaaag 
aaggt aagaa 
cgggtattgc 
atggttt ggt 
ataatgtcaa 
atgtcaaaag 
atattagtga 



aggattatta 
ctacgggacg 
ccagggaaac 
accagaaaaa 
ttccgggcag 
cattttttat 
tgcact gaag 
tccggtaaat 
gataaaaaat 
agtcaattat 
taataatgca 
caaaatttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



<210> 128 
<211> 543 
<212> DNA 
<213> E . Coli 



<400 
atgaaacgtt 
tttgctgcag 
tgtaaattcg 
gtctttgaag 
atgttaaaaa 
gataaaataa 
ctctatttag 
tataaaacag 
gataactcag 



> 128 
caattattgc 
acgttgatac 
aagcgggtgg 
gtaaagctaa 
ttagctgccc 
ccggtaacga 
gtgataacag 
cggaaggtca 
tgcaatcagg 



tgccgctgtc 
cggaacatta 
tgattcagta 
atattctacc 
gaaagaagt t 
taaggcgata 
cgatgtcctg 
atatgctatt 
tgatgtgtt a 



ttttcttctt 
actattaagg 
agtattaata 
tatgatgatg 
gctggtgtaa 
gccagtagca 
gatgtttctg 
ccgtt taaag 
tcttctctgg 



tttttatgag 


cgctggagta 


60 


ggaatattgc 


agaatctccg 


120 


tgccgactgt 


accaaccagt 


180 


cagtcggtgt 


aaccagcagc 


240 


aactctcgtt 


igattaccaac 


• 300 


acgataccgt- 


gggttactat 


360 


caccttttaa 


cattgagagt 


420 


caaaatacct 


gaaactgaca 


480 


ttatgcgtgt 


ggcgcaggat 


540 
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<210> 129 
<211> 339 
<212> DNA 
<213> E. Coli 

<400> 129 

atgagttcag agcgagatct ggttaatttt cttggcgatt tttcaatgga tgtggccaaa 

gcagttatag ccggtggtgt tgcaaccgct attggaagtc tggcttcttt tgcctgtgtt 

agctttggct ttccagtaat tcttgtcgga ' ggagcaattt tactgacagg gatagtgtgt 

acagttgttt taaatgaaat cgatgctcaa tgccatttat cagaaaaatt aaaatatgca 

attagagatg gactaaaacg gcaacaggaa cttgataaat ggaaaaggga aaacatgact 

ccatttatgt atgttcttaa cactccaccc gtgatatga 

<210> 130 
<211> 582 
<212> DNA 
<213> E. Coli 



<400 
a tgactgact 
aagtttctcg 
atggggctgg 
acgtggattt 
attgctgtgg 
cgtctgctgg 
tt get gaata 
gctgtcggtt 
gctgatgtcc 
tctctggcct 



> . 130 
acctgttact 
gtctctgtcc 
caacaaegtt 
tgatcccact 
tcgtgcagtt 
ggattttttt 
teaatctegg 
t etcget ggt 
cggcaccttt 
ttatgggctt 



gtt tgt egga 
gttta tgggg 
tgtgatgacg 
taatctgatt 
caccgagat g 
gecget tatc 
gcacaattt c 
gatggtgctc 
tegeggtaat 
tagtggtttg 



<210> 131 
<211> 579 
<212> DNA 
<213> E . Coli 



<400 
a tgaatgeta 
attctgggtt 
gacgaaatct 
g.cggaagcca 
gtgatgctaa 
caagagataa 
actaaatgta 
aeggtaatga 
tgcatctcgt 
attcccgtgc 



> 131 
tctggatt gc 
atgcctcccg 
tacegcagag 
tcagctgtaa 
aaattgeega 
cgcctgcgcg 
ttcaggcgtg 
gtgatct ctg 
tgcaaccggt 
gtatcattcc 



<210> 132 
<211> 2223 . 
<212> DNA 
<213> E . Coli 



actgtact gg 
gtt tccaaaa 
ctggcgtcta 
tacctgcgca 
gtggtgcgca 
accaccaact 
ttgeagtegg 
ttcgccgcca 
gecattgegt 
gtgaagttgt 



tcaa taact t 
aactggaaac 
t ttgcgcctg 
ccctggcatt 
aaaccagccc 
gtgcagtgct 
cget gtaegg 
t ccgcgaacg 
t aattacege 
aa 



tgtactggtc 
cgcgatgggc 
gcttat cgat 
tattct ggtg 
ggtgctttac 
cggcgtggcg 
tttttccgcc 
ccttgctgtg 
aggtcttatg 



cgtt gccgcc 
ccgt tt tgcg 
ccagtgtggt 
cggtgaaaaa 
gttgcttaat 
gatggtggcg 
t ceggtagae 
tacgggctgc 
cgcagaaaca 
cgtggaacac 



gtgagcctgc 
gt ggaagacg 
cagtgcggtt 
ateaacegtt 
gtcgagccgc 
gttattgatg 
gccatcgttg 
aatttatgtg 
cctgactcct 
catgettaa 



tgggcctggc 
ateeggtegt 
atcccggctg 
gcgccccagg' 
agccgctgga 
aaaat aactg 
gcgctacccg 
ttgatccgtg 
ggaaatggga 



gtttggcgcc 
tgagaaaatt 
tcgcccctac 
tggegaaget 
tggcgaagcg 
tattggctgc 
tgecatgeat 
cccgacgcac 
tctgaacacc 



60 
120 
180 
240 
300 
339 



60 
120 
180 
240 
300 
360 
420 
480 
540 
582 



60 
120 
180 
240 
300 
360 
420 
480 
540 
579 



<400> 132 
atgcttaagt tattctctgc 
catccaccgg agatgaaaac 
cagcgttttg ttattccact 
gteggegata aagtattgcg 
gttcacgcgc ccacctcggg 
tcagctttag ctgaattaag 



attcagaaaa 
ccagt ccaac 
gaaacagcat 
cggccagccg 
taccgttacg 
cgtgattatt 



aataaaat ct 
ggtacacccc 
attggcgctg 
cttacccgtg 
getattgege 
gatgecgatg 



gggatttcaa 
tgcgccaggt 
aaggtgagtt 
gtcgcggcaa 
cccactctac 
gtgaagactg 



cggcggcatc 
acccctggcg 
gtgcgttagc 
aatget gect 
ggctcatcct 
ctggat cccg 



60 
120 
180 
240 
300 
360 
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cgcgacggct gggcc^atta 

cagtttggtg ttgccgggct 

ggcggagata agattgaaac 

gccgatgacc gtttgatgca 

gcgcatattc tgcagccacg 

atttccatgc tgcgcgcggt 

accaaatatc- cttctggcgg 

ccacatggcg ggcgttcatc 

gcagtgaaac gtgccgttat 

ggcgaagcaa tcgctcgccc 

ttattgaatg atgccggatt 

ctaatgggct ttaccttgcc 

ttggctccct ctgccaatga 

agcgcctgtg ctgacgcctg 

aaaggtcagc aacacgataa 

gcttgcgcgt gggtttgccc 

gctgaaattg cggctattcg 

gaagcgcgcc aggctcgtct 

gcagccgttc aacctgcagc 

aaagagaaac aggcccaggc 

aacagtgcaa ttattgcagc 

gaactgcagc aaactaacga 

gcagctatcg cccgcgccaa 

gaacaacagg tcgatccgcg 

cgcaagctgg aacagcaaca 

gccgccgtcg aagccgctat 

aatgccgagc cagaacaaca 

cgagccaaag cgcgcaaacg 

gatccgcgca aagctgccgt 

cagcaacagg ctaatgcggt 

gcggctattg cccgcgctca 
taa 



tcgcactcgc 
gggcggtgca 
gttgattatc 
ggattgcgcg 
cgaaatt ctt 
gctggcggac 
tgctaaacaa 
cgat at cggc 
t gatggcgag 
gggcaacgtc 
ctgcccctct 
atggctggat 
acttggcgaa 
ccct gcggat 
agctaccacg 
gagcaatatt 
tcaggaagaa 
ggagcgcgaa 
caaagataaa 
tacacagcct 
acgggaagcc 
cgcagcaacc 
agcgcgcaag 
caaagccgcc 
ggctaatgcg 
tgcccgtgcc 
ggtcgatccg 
ggaacagcaa 
cgaagcggct 
accagaagaa 
ggccaaaaaa 



agtcgcgaag 
ggattcccga 
aacgcggctg 
gctcaggtcg 
atcggcattg 
tctaacgata 
teaacctaca 
gtat taatgc 
ccgattaccg 
tgggcacggc 
gccgatcaaa 
gtcccggtcg 
ccacaggaag 
cttttgccgc 
cataacattg 
cccctggtgc 
aagcgcgccg 
aaagcggctc 
gatgcgattg 
attgt gat ta 
cgt aaagcgc 
gttgctgatc 
ctggaacagc 
gtcgaagccg 
gaaccagaag 
aaagcacgca 
cgcaaagccg 
ccggctaatg 
attgcacgcg 
caggttgatc 
gccgcccagc 



agttaatcga 
caggcgttaa 
agtgcgagcc 
tagagggtat 
aagataacaa 
tttctctgcg 
ttctgaccgg 
aaaacgtcgg 
agcgtgttgt 
tggggacgcc 
tggtgattat 
taaagattac 
aacaaagctg 
aacagttgta 
ctgattgcat 
aatatttccg 
cagaagccaa 
gccttgaacg 
ctgccgctct 
aagcgggcga 
aagccagagc 
cacgt aaaac 
aacaggctaa 
ctattgcccg 
aacaggt cga 
agctggaaca 
ccgt cgaagc 
cggagccaga 
ccaaagcacg 
cgcgcaaagc 
agaaggttgt 



gcgcatacat 
attgcagggt 
gtacattacc 
tcgcattctt 
accgcaggcg 
ggtgat'tcca 
gaagcaggtt 
cactgcttat 
aaccctgact 
agtgcgtcat 
gggtggcccg 
caactgtctg 
catccggtgt 
ctggttcagc 
tgaatgtggg 
tcaggaaaaa 
agcgcgtttc 
acataagagc 
ggcgcgggtg 
acgcccggat 
gaaacaggca 
tgccgttgaa 
tgcggaacca 
tgccaaagcg 
tccgcgcaaa 
gcaacaggct 
cgctattgcc 
agaacaggtt 
caagctggaa 
ggcagttgcc 
aaacgaggac 



420 
480 
540 
600 
660 
720 
" 780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2223 



<210> 133 
<211> 1059 
<212> DNA 
<213> E. Coli 



<400> 133 

atggtattca gaatagctag ctccccttat acccataacc agcgccagac atcgcgcatt 60 

atgctgttgg tgttgctcgc agccgtgcca ggaatcgcag cgcaactgtg gttttttggt 120 

tggggtactc tcgttcagat cctgttggca tcggttagtg ctctgttagc cgaagctctc 180 

gtactcaaac tacgcaagca gtcggtagcc gcaacgttga aagataactc agcattgctg . 240 

acaggcttat tgctggcggt aagtattccc cccctcgcgc catggtggat ggtcgtgctg 300 

ggtacggtgt ttgcggtgat catcgctaaa cagttgtatg gcggtctggg acaaaacccg 360 

tttaatccgg caatgattgg ttatgtggtc ttactgatct ccttccccgt gcagatgacc 420 

agctggttac cgccacatga aattgcggtc aacatccctg gttttatcga cgccatccag 480 

gttattttta gcggtcatac cgccagtggt ggtgatatga acacactacg cttaggtatt 540 

gatggcatta gtcaggcgac accgctggat acatttaaaa cctctgtccg tgccggtcat 600 

tcggttgaac agattatgca atatccgatc tacagcggta. ttctggcggg cgctggttgg 660 

caatgggtaa atctcgcctg gctggctggc ggcgtatggt tgctatggca gaaagcgatt 720 

cgctggcata ttcccctcag cttcttagta acgctggcgt tatgcgcaat gttgggctgg 780 

ttgttctcac cagaaacact ggcagcaccg caaattcatc tgctgtctgg agcgaccatg 840 

ctcggcgcat tctttatttt gactgacccg gttaccgctt ctacgaccaa tcgtggtcgt 900 

cttattttcg gcgcgcttgc gggcttatta gtctggttga tccgcagttt cggcggctat 960 

cctgacggcg tggcttttgc cgtcctgctg gcgaacatca 'cggttcctct gatcgattac 1020 

tacacgcgtc cgcgcgtcta cggccatcgc aaagggtaa 1059 

<210> 134 
<211> 621 
<212> DNA 
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<400> 134 



atgctgaaaa ctatccgaaa acacggcatt acgttggcgc tatttgcagc gggttcaaca 

gggttaactg cggccatcaa ccagatgacc aaaacgacga ttgctgaaca ggccagtctg 

caacaaaagg cgttatttga tcaggtgctg ccagccgaac gctataacaa tgcgctggca 

cagagttgct atctggtaac tgcgccagag ttaggtaaag grgagcat eg . ggttt acatc 

gecaaacagg atgacaaacc ggtagccgcc gttctggaag caaccgcgcc agatggctat .duu 

teeggtgega ttcagctget ggtgggagcc gattttaacg geaeggtact tggcacgcgc 360 

gtgacagagc accacgaaac gecagggett ggcgataaaa tegaactgeg cctttctgac 420 

tggatcaccc attttgeggg taaaaaaatc agtggtgcag atgatgegea ctgggcggtg 

aagaaagatg gtggtgattt cgaccagttc accggcgcga cgattactcc ccgcgcggtg 

nt T Pi Pit nrnn tPiP\PiPirnrnr rnnst-1-nl-sr rr/-*i-<-a/-i=s^/-T+- 4 — _ 4- .i 4. 



60 
120 
180 
240 
300 



480 
540 
600 



-3 ^--33-"=» — ^ — — — — *,^ww a. k_ I, u L. l_ \_ ^ LL-Utyuyy LLj 

gttaatgegg taaaacgege eggattgtae getcagaegt taccggcaca actttctcaa 
cttcctgcct gtggagaata a 621 

<210> 135 
<211> 696 
<212> DNA 
<213> E . Coli 

<400> 135 

gtgagcgaaa ttaaagacgt tattgttcag gggttgtgga aaaacaactc tgcgctggtc 60 

cagttgeteg gcctttgtcc tctgttggcg gtcacgtcca ctgccactaa cgctctgggt 120 

ttaggacttg cgactacgct ggtactgacg ctgaccaacc tgaccatttc gaegctgegt 180 

cactggacgc cagecgagat ccgcattccc atttacgtga tgatcatege ctcggtggtc 240 

agcgctgtac.agatgctgat caacgcctac gcctttggcc tgtatcaatc attagggatt 300 

tttattccgC tgattgtcac taactgtatc gttgtgggcc gegctgaage cttcgccgcc 360 

aaaaaaggtc cggcgctttc ggcactggac ggcttttcaa ttggtatggg cgcaacctgc 420 

gccatgttcg tgctgggttc actacgegaa attateggea atggcacatt gtttgacggt 4 80 

geagatgegc. tgttaggtag ctgggcaaaa gtattacgcg tggagatttt ccacaccgac 540 

tcccctttcc tgctggcgat gctgccacca ggtgcattta ttggcctggg actgatgetg 600 

gcaggaaaat acctgattga tgaaagaatg aaaaagegee gtgetgaage agetgeagaa 660 

cgtgcattgc caaacggtga aacagggaat gtctga 696 

<210> 136 

<211> 636 

<212> DNA 

<213> E . Coli 

<400> 136 

atgaataaag caaaacgcct ggagatcctc actcgcctgc gtgagaacaa tcctcatccc 60 

accaccgagc ttaatttcag ttcgcctttt gaattgctga ttgeegtact gctttccgct 120 

caggcgaccg atgtcagtgt taataaggcg aeggegaaac tctacccggt ggegaatacg. - 180 

cctgcagcga tgcttgaact gggcgttgaa ggggtgaaaa cctatatcaa aacgattggg 240 

ctttataaca gcaaagcaga aaatatcatc aaaacctgcc gtatcttget ggagcagcat 300 

aatggcgagg ttceggaaga tcgtgctgcg cttgaagccc tgcccggcgt aggtegtaaa 360 

acagccaacg tegtattaaa cactgcattc ggctggccga etattgetgt cgacacgcac 420 

attttccgeg tttgtaatcg tactcaattt gcgccgggga aaaaegtega acaggtagaa 480 

gaaaagctac tgaaagtggt tccagcagag tttaaagtcg actgccacca ttggttgatc 540 

ctgcacgggc gttatacctg cattgcccgc aagccccgct gtggctcttg tattattgaa 600 

gatctttgtg aatacaaaga gaaagttgac atctga 636 

<210> 137 

<211> 504 

<212> DNA 

<213> E. Coli 

<400> 137 

atgaaaagac ttcacaagag gttcctgtta gctacgtttt gcgcgttatt cacagcaact 60 

ctccaggccg ccgatgtcac tatcactgtt aatggtcggg tagtegctaa accctgcact 120 

attcaaacca aagaagctaa cgttaatctc ggggatcttt atacgegcaa tetgeaacaa 180 
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cctggttctg catctggctg gcacaatatt actttgtcat taaccgattg tccggttgaa 240 

acaagtgcag tgacggcaat cgtgacaggt tcaactgaca atacgggtta ttacaaaaat 300 

gaaggtactg ccgaaaatat tcagatagag ctgagggatg accaggatgc tgcgttaaaa 360 

aatggcgata gcaaaacggt tattgttgat gagatcactc g.taatgcaca gtttccactt 420 

aaggcaagag ctatcacggt gaatggaaac gcaagccagg gaacgatcga ggcgctaatc 480 

aatgtgatct acacctggca ataa 504 



<210> 138 
<211> 531 
<212> DNA 
<213> E . Coli 



<400 
atgaaataca 
ttatccgcag 
tcatcggatt 
acgaccggta 
gggacaacgg 
ttgaaactgg 
aatatgcgtc 
ccagaacaga 
aatccgggac 



> 138 
at aacatt at 
atagcgttat 
cgcttaattt 
gcacaagtcc 
gggttcgggt 
atgaaggaag 
cggtgaaact 
acaatatttt 
tggtgagggc 



tttcctcggt 
taaaattagc 
taccgtagat 
agccgtccct 
tgcatttaac 
caatacggcc 
gaatgatctt 
gccttactcc 
ttcggcaacc 



ttatgtctgg 
gggcgcgtcc 
ctccaaaaaa 
tttcagat ta 
ggtattgagg 
tccggtttgg 
catgccggga 
gctcgtctga 
tttacccttg 



ggttaaccac 
tc'gattatgg 
acagtgccag 
cgttaagtga 
atgcaga'aaa 
gtatagaaat 
tgcagtggat 
agtcaactca 
aatttcaata 



ctattctgct 
ctgcacagt c 
acaatttcca 
atgcagcaaa 
taatactttg 
attggacgca 
cccactggta 
gaagtccgtc 
a 



60 
120 
180 
240 
300 
360 
420 
480 
531 



<210> 139 
<211> . 1149 
<212> DNA 
<213> E. Goli 



<400 
atgagtggtt 
gattatttta 
cttggaaatt 
cgcagcgacc 
'ctgaattaca 
acgcttaatg 
tcaaacgcca 
gtttatggca 
cacggaggt a 
tatggtaata 
atgagtggtg 
acaatggttc 
att cataccg 
cgtgttgctc 
actgtcat cc 
aaagtattaa 
cacggagaga 
cttccacagt 
gtcgagtaca 
tgtcgctaa 



> 139 
acaccgtcaa 
atctgttcta 
acggtacgac 
agcaaatatc 
gctattccaa 
ttcccttcag 
gttacagtat 
ctctgctgcc 
atacatcgtc 
ctaatgtcgg 
ggattattgc 
tggttaaggc 
actggcgt gg 
tt aacgcgaa 
caactcacgg 
tgacgttgaa 
at aaaaatgg 
cagggcaatt 
agcttcctga 



gcct cctacc 
cagtaagcgt 
atttttcagt 
atttggatta 
taatatatgg 
tcattggatg 
gtcaaacgat 
ggataataac 
tggcaccagt 
ttacagtcgg 
tcatgctgat 
tcctggtgct 
ct atgcca ta 
tt cccttgca 
t get at tgcc 
gtacggtaat 
cagcattgtc 
acaggtttca 
agtttctcct 



ggagacacca 
ggtcaggaac 
gccagtcgcc 
aatgtgccgt 
caaaacgatc 
cgtacagaca 
ttgaaaggcg 
ctgaattata 
ggttacagtt 
agtggtgaca 
ggcatcacct 
gataatgtca 
ttaccatttg 
gataatgttg 
agagcaacat 
aagagcgttc 
gcggaaaatg 
tggggcaaag 
ggtaccttac 



atgagcagac 

aaataagcat 

aaagttactg 1 

ttggtgatat 

gggatcattt 

gtcagtegge 

gcatgaccaa 

gcgttcaggt 

ctcttaat.ta 

geagecagat 

ttggacagcc 

aaat agagaa 

cgacagaata 

aactggatga 

ttaatgeaca 

cat t cggtgc 

gtcaggttta 

ataaaaactc 

tgaaccagca 



acaatttatt 
ctctcagcag 
gaacaegtea 
tacgacttcg 
actegctttt 
atttegtaat 
tctat egggg 
cggtaacacc 
tcgtggagct 
ttattacgga 
getgggegae 
ccagaccgga 
tagagaaaac 
aaccgtggtc 
aatcggcggg 
aattgtcaca 
tctgactgga 
aaactgtatt 
gacagcaatc 



60 
120 
180 
240 
300 
360 
4 20 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1149 



<210> 140 

<211> 417 

<212> DNA 

<213> E . Coli 

<400> 140 

atgattgega ttgecgacat ettgeaagea ggagaaaagc taactgctgt ggcacctttt 60 

ctggcgggta ttcagaacga ggaacaatac acccaggcgc tggaactggt ■ agatcatctg 120 

ctgctcaacg atcctgaaaa ccccttgctg gatctggtgt gtgccaaaat aaccgcgtgg 180 

gaagaatcag cgcccgaatt tgcggaattt aatgccatgg ctcaagccat gcctggcggt 240 

atagccgtga ttcgtaccct tatggatcaa tatggtttaa ccctttccga tetgeeggaa 300 
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attggcagta aatctatggt gtcacgcgtt ttgagcggga agaggaaatt aacgctggaa 
cacgctaaaa aattggcaac gcgattcggc atttctcccg ccttgtttat tgattaa 

<210> 141 
<211> 315 
<212> DNA 
<213> E. Coli 



360 
417 



aaaaataccc gcaacataaa 
at ttcaaaaa acctgagtca 
tggataagca ttatgttttc 



<400> 141 • 

atgcacctga taactcaaaa agcattgaaa gatgctgcgg 

acggagttgg tggctctggg gaacacgatt gctaagggat 

ttaaaagcag tattcccatc tctggataae ttcaaatatc 

aatgttgggg gcaatgaatt acgtgttgta gcaatggtct tttttgaatc gcaaaagtgc 

tacatacgtg aagttatgac gcataaagaa tacgatttct ttaccgctgt tcatcgtact 
aaggggaaaa aatga 

<210> 142 
<211> 7152 
<212> DNA 
<213> E. Coli 



60 
120 
180 
240 
300 
315 



<400 
ttgctatcag 
tcagggaagg 
atattatgcg 
caacttgcgt 
ccagttcctg 
ttggaatcgg 
ctcaaccagt 
gatgtcccgg 
aacctcgagc 
atgaacagcg 
gcaatgacag 
gattttagcc 
aatctctttt 
ggcttaggtt 
cacgatctta 
ct aaaattaa 
gacaacgatt 
cccgcctggc 
gccctgttcg 
tataccccct 
aat gacaccc 
cttgacccga 
gat cgcaaca 
acagaccccg 
aaatatgccc 
gtggtcacaa 
ccagaaaccg 
tcgaatcgtg 
tcggtatcgt 
tttattgcgc 
gaaggtgttc 
cagatcctga 
gtggatgcgg 
cactcgtcaa 
gtagaactga 
gcagtcagca 
ggcgtctata 
ttaatgcaaa 
cagtcagcga 



> 142 

tatttacatt 
taagcatggc 
ggat gggaat 
tccctatggc 
cacaaattgc 
cccaaagcgt 
ttcgtacgtt 
cacaagt t ag 
aacagatagc 
agcaagcggc 
actggttaag" 
tgaagaactc 
t cagtcagca 
ggcgtcattt 
gccgttacca 
gcagtaacgg 
atgaagcacg 
cgcaccttgg 
ataaagacga 
tcccgctgat 
gttttgccgt 
atgaagtcgc 
acaatatcgt 
tgacagggaa 
tgaaaggcta 
cgggtaaaga 
ataacacctg 
aacagagcat 
taagtaccca 
atgatgcagc 
aggacatcac 
ccacaggagc 
ctaaagcccc 
t taagattga 
gagatgaaaa 
tcgacaacgt 
aggcgaccta 
actggaatga 
aaattgcgac 



ttttcgctgt 

tacgaagaag 

taaactacgc 

tgcggcagca 

cattgcaaat 

t gccgaacgt 

t gctcgaagt 

tgaaaaaaaa 

cagtactt ca 

aaatatggcg 

ccgcttcggt 

ccagt tcgat 

tactctccat 

cactcccaca 

ctcccgcgcc 

ctatttgcga 

cccggccaat 

cggtaaactg 

t cggcaaagt 

gaccttcagc 

cgattttacc 

tgcacggcgt 

t ctggaatat 

gt caggagaa 

t aacgtcgaa 

tattctggtt 

gccgattgaa 

ggtggtcgtt 

aacattgaac 

aggtaatcct 

ccttt ctgac 

gatgt ctggc 

cgccgtggtg 

taaggaccgt 

tgacaaacct 

gaaaccagga 

taccgcctat 

agatttgcat 

attatctgcc 



gctagaaagg 

agaagtggag 

cgcttaactg 

caaggtgtgg 

gccaatacgg 

ttcggtattt 

t ttgataatg 

ttaaccccgc 

cagcaaatcg 

cgtggatggg 

a ccgcaagaa 

tttctccatc 

cgtactgacg 

tggatgtcgg 

g.gcattggcg 

ctgaccaact 

ggctgggatg 

gtctatgaac 

aatcctcatg 

gcggagcaac 

tggcaacctg 

agccttgcag 

cgcaaaaaag 

gtgaaatcac 

gccaccgcac 

accctgccgg 

gtcaccgccg 

caggcaccta 

gcggattccc 

gttgtcgggc 

tggaaagata 

acgctgacgc 

aatatcattt 

tatctctccg 

gttaaggaac 

gtcactacag 

a ccaaaggca 

accgctggat 

agcaataatg 



gcgcatttat 

aagaaataaa 

cgggtatctg 

taaacgccgc 

tgccctacac 

cggtggctga 

tccgccaggg 

cgccgggtaa 

ggtctctgct 

cctcttctca 

tcacgctggg 

cgtggtatga 

agcgtacgca 

gcatcaactt 

cggagtactg 

ggcgcagcgc 

tacgcgcaga 

agtattatgg 

ccataaccgc 

gccagggt aa 

gcagcgcaat 

gcagccgt ta 

aactggtt eg 

tggttt'cgtc 

tggaagctgc 

ettaceggtt 

aagatgtcaa. 

cgctaagcca 

attcaaccgc 

tggtgctctc 

atggtgacgg 

tgatgecaca 

ctgtttcgtc 

gcaatcctat 

aaaaacagca 

actggaaaga 

gtggacttac 

ttatcatcga 

gtgtgctcgc 



gLLagctcgt 


60 


tgaccgacaa 


120 


LLLydLaaC L 


1 o U 


aacccaacaa 


2 4 0 


tut Lggagcg 




gttacgcaaa 


3 60 


Lya Lgaacig 


/ion 
4 ZO 


L-aycagLgac 


4 0 U 


f* r-i /~r o a /— t s +- 

LyL.Lyaay at 


D 4 U 


n n i~ +" e~* a py n r~* 
yy ^ l LLay yu 


ouu 


t-yuggaugaa 


a a n 
DDL) 


aaLyLuLya L. 


79A 
/ Z\J 


gattaacaac 


780 


ctttttcgac 


840 


gegegactat 


900 


acctgaactg 


960 


aagctggcta 


1020 


cgatgaagt-g 


1080 


tggacttaac 


1140 


acagggegaa 


1200 


gcagaaacag 


1260 


tgatctggtg 


1320 


cctgaccctg 


1380 


gctacaaacc 


1440 


cggtggcaaa 


1500 


caccagtacg 


1560 


aggcaatttg 


1620 


gaaagattcc 


1680 


cacactgact 


1740 


gacgcgtcac 


1800 


aagctatacc 


1860 


gctgaa tggt 


1920 


atcccgaact 


1980 


cgaggtgacg 


2040 


actgaataac 


2100 


aacegcagat 


2160 


tgegaagcta 


2220 


cgccaacccg 


2280 


caatgagaat 


2340 
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gcagcaaaca ccgtctcggt caatgtcgct gatgaaggaa gcaacccaat caa'.tgatcat 2400 

accgtcacgt ttgcggtatt aagcggatcg gcaacttcct tcaacaatca aaacaccgca 24 60 

aaaacggatg ttaatggtct ggcgactttt gatctgaaaa gtagtaagca ggaagacaac 2520 

acggttgaag tcacccttga aaatggcgtg aaacaaacgt t aatcgt c.ag ttttgtcggc 2580 

gactcgagta ctgcgcaggt tgatctgcag aagtcgaaaa atgaagtggt tgctgacggc 2640 

aatgacagcg tcacaatgac cgcgaccgtc cgggatgcaa aaggcaacct gctcaatgac 2700 

gtcatggtca ctttcaatgt taattcagca gaggcgaaac cgagccaaac cgaagtgaat 2760 

agccacgacg ggatcgccac agctacgctg accagtttga aaaatggtga ttatagggtt 2820 

.acggcctctg tgagctctgg ttcccaggct aatcaacagg tgaattttat cggtgatcaa 2880 

agtactgctg ccctgaccct cagtgtgcct tcaggtgata tcaccgtcac caacacagct 2940 

ccgcaatata tgactgcaac cttgcaggat aaaaatggca acccactaaa agataaagaa 3000 

atcaccttct ctgtgccaaa cgacgtcgca agtaagttct cgattagcaa cggaggaaaa 3060 

ggcatgacgg atagtaacgg ggttgcaatc gcctccctga ccggcacgtt agcgggcacg 3120 

catatgatca tggctcgtct ggctaacagc aatgtcagcg atgcacagce aatgacgttt 3180 

gtggcggata aagacagagc ggttgtcgtt ttgcaaacat cgaaagcgga aatcattggg 3240 

aatggcgtgg atgagacaac tctgacagca acagtgaaag atccgtcgaa tcatccggtg 3300 

gcggggataa cggtaaactt caccatgcca caggacgttg cggcaaactt tacccttgaa 3360 

aataacggta ttgccatcac tcaggccaat ggggaagcgc atgtcacgct gaaaggtaaa 3420 

aaagcgggca cgcatacggt taccgcaacg ctgggtaata acaataccag tgattcgcag 3480 

ccggtaacat ttgtggcgga caaagcctcg gctcaggttg tcctgcagat atcaaaagat 3540 

gagatcacag -gtaatggcgt cgatagcgca acgctaactg caacggttaa agatcagttc 3600 

. gacaatgagg tgaataatct tccggtaaca ttcagctcag cctcttcagg actcaccctg 3660 

accccgggag taagtaatac caacgagtct ggcatcgcgc aggccactct cgcaggcgtt 3720 

gcctttggtg agaagacggt tactgcatca ctggctaata^ atggtgccag cgacaacaaa 3780 

actgtgcatt ttattggcga cacagcggcg gcaaaaatta tcgagttggc gcctgtccca 3840 

gacagcataa tcgccggtac cccgcagaac agctccggca gcgtcatcac cgccacagtc 3900 

gttgataata atggctttcc ggtgaaaggt gtgactgtga acttcaccag caacgcagcg 3960 

acagccgaaa tgacgaacgg cggtcaagcc gtgacgaacg aacagggtaa ggctaccgtc 4020 

acttatacca atacccgctc ctcgatagaa tcaggagcga gaccggatac cgttgaggcc 4080 

agtctggaaa atggtagctc cacgcttagc acatcaatta atgtcaacgc tgatgcgtct 4140 

acggcacatc tcaccttgct acaggcactt tttgatacag tctccgcagg cgagacaacc 4200 

agtctgtata ttgaggtgaa ggataattac ggcaacggtg tcccccagca ggaggtaacc 4260 

ctcagcgttt caccaagtga aggcgtgacc cccagtaata acgctatata tactaccaac 4320 

cacgacggca atttttacgc aagctttacc gctacaaaag ccggggttta tcaattgacg 4380 

gcaaccctcg aaaatggcga ttcgatgcaa caaacagtga cctatgtgcc gaacgtcgcg 44 40 

aatgctgaaa tcacgctggc agcctcgaag gatccggtga ttgccgacaa taacgatctc 4500 

acgacactaa cagcaacagt cgctgataca gagggcaatg cgatagccaa cactgaggta 4560 

acatttactc tgccggaaga tgtgaaggcg aacttcacgc tgagcgatgg cggtaaagtg 4620 

attactgatg ctgaaggcaa agcgaaagtc acgctgaaag gtacaaaagc aggcgctcat 4680 

actgttacag catcgatgac tggcggtaag agtgagcagt tggtggtgaa ctttattgcg 4740 

gatacgctca ctgcgcaggt taatcttaac gttaccgagg acaattttat cgctaataac 4800 

gtcgggatga ccaggctgca ggcaacagtg actgatggaa acggcaaccc gttagccaat 4860 

gaggcggtga cattcacgct accggcagat gtgagcgcaa gctttactct cggacaaggc 4 920 

ggttccgcca ttactgatat caacggcaag gctgaagtta cactgagcgg tacaaaatcc 4 980 

ggcacctacc ccgtgacagt tagcgtgaac aattatggtg- tcagtgatac gaaacaggtg 5040 

actttgattg ccgatgctgg taccgcaaaa ctagcctcct taacctctgt atactcattc 5100 

gtcgtcagca cgaccgaggg cgcaaccatg acggcaagcg tcactgacgc taacggcaac 5160 

ccggtagaag gcataaaagt taatttccgc ggaacctccg tcacgctaag cagcaccagc 5220 

gttgaaacgg atgatcgggg tttcgctgaa attcttgtga caagcaccga ggtcggactg 5280 

aaaacagttt cagcctctct ggcagataaa cctactgaag tcatctcgcg attactgaat 5340 

gccagtgcag atgttaattc tgcgacgatt accagtctgg agataccgga aggtcaggta 5400 

atggtcgcac aagacgtagc agttaaagct cacgttaacg accagtttgg caacccggtt 5460 

gcgcatcaac ccgtgacatt cagtgcagag ccatcctcgc aaatgatcat cagccagaat 5520 

acggtctcta ctaatacgca gggtgtagcc gaggtcacca tgacgcccga aagaaacggt 5580 

tcgtatatgg tgaaagcatc cctgccgaat ggagcctcac ttgagaaaca actggaggct 5640 

attgatgaaa aactgacact cacggcgtcc agtccgctta tcggtgtcta tgcccctaca 5700 

ggcgctactc tgacggcaac gctaacctct gcaaatggca ctccagtgga gggtcaggtc 5760 

atcaacttta gcgtaacgcc agaaggggcg acgttaagtg gcggaaaagt gagaactaac 5820 

tcttcaggtc aggctccagt cgttttgacc agcaataaag tcggtacata tacggtgact 5880 

gcatctttcc ataacggcgt aacaatacag acacagacaa ccgtgaaagt cactggcaac 5940 

tcaagcaccg cccatgttgc tagctttatc gctgatccat cgactatcgc cgccaccaac 6000 
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actgatttaa etacctTaaa ggcaacggtt gaggatggca 
ctcactgtgt acttcgcctt aaaaagcggc tctgccacat 
accgatcaaa acggaatcgc gacaacaagc gtgaaaggag 
gtaagcgcag tcacgaccgc tggtggaatg caaacagtag 
ccggcagaca cctcgcagtc cgtccttaag agcaatcggt 
accgatagtg ctgaattacg tcttgttctg cacgatatat 
tctgaaggga tggaatttgt gcaatcaggt actaacgtgc 
attgattaca gtctaaatat caacggtgat tacaaagcca 
ggtatcgcaa cgctgatccc tgtattgaat ggtgttcatc 
atacaattca ctcgcgcaga agaca'aaata atgagcggta 
gacctaccga caactacatt cccttcgcag gggttcaccg 
aatgacaact ttgccccagg aaaaacggcg gctgattatg 
tgggtcgatg ttgatgctac cggtaaagtg acatttaaaa 
aggattacgg cgacgccaaa atcaggaggc cctagctatg 
agttggtggg tgaacgccgg cgaggctttc atgatataca 
agcagcaatg gctacacgct ccccagagca aactatttaa 
atcgggtcac tgtacagtga atggggagat atggggcatt 
caatcaaata tgtattggtc atctagtccc gcaaactcaa 
ctggcaacag gtgatcaaag cgtatttgaa aagcttgggt 
aaaaacctgt ga 
<210> 143 

. <211> 186 
<212> DNA 
<213> E . Coli 



gtggtaacct 
taacgtcatt 
cgatgacagg 
atataacgct 
cat cact gaa 
caggcaatcc 
cctatat aaa 
ctgttacagg 
aagctggtct 
cagtatcagt 
gggcgtatta 
agttttcaag 
atgtcggcag 
.tatacgaaat 
gccttgctga 
accactgtag 
acacgactga 
gcgaacaata 
ttgcttatgc 



gatcgaaggt 
aacagcggtg 
tagcgtcacg 
ggtggctggc 
aggggactat 
gatcaaagtt 
aattagcgca 
aggcggagag 
gagtaccaca 
caatggtact 
tcagttgaat 
ctctgcctcc 
caatt cggaa 
ccgtgtgaag 
aaatttttgc 
ttcccgaggc 
cgctggtttt 
cgtagtttcc 
gacatgttat 



6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
684 0 
6900 
6960 
7020 
7080 
7140 



7152 



<400> 143 

atgagcaaag gcgcattata tgaatttaac aatccagatc aactgaaaat acctctccct 
cataaacaca tagcgtcaac attcaatgac ataatgagta aagatgttgg ttatgcatac 
gtatcattac tctatgcctg tcccttaaaa acccactcat taagactgaa tccattcagc 
aaatga 

<210> 144 
<211> .1197 
<212> DNA 
<213> E. Coli 



60 
120 
180 
186 



<400 
atgcaggtgg 
caggatggtc 
ggcttaatgg 
accaacggta 
gcggaggtta 
cgccaat tgc 
ttaaacacgg 
cagctatatc 
agcaaaaccc 
ttaagcgcac 
aaagtggcaa 
ttgcaggcgg 
gcattttatc 
gatctgtt cc 
ttcgatagtg 
atcgccagct 
caggttcaga 
cgtgtggtag 
.agcgaagcca 
tggctggatg 



> 144 
ctgaacagcg 
cgcagatcga 
ggccgtttgc 
cttttggctt 
ctgcccgcct 
tggctggcag 
tcttgcagca 
agaacgggat 
ggcagcagct 
ttacaaataa 
gccagcttcc 
cgcactggta 
ctgacatcaa 
gtcattccgc 
gtcgtcttaa 
acaacaaagc 
cactggcgga 
gtcttgcgca 
gaat ccccgc 
cctccattca 



cattcagcta 
ct tttcggcg 
tctgaacgat 
aacggcgggc 
gggt acggtt 
cgtagcccgc 
aatagaaaaa 
tacttcttca 
caacgatgtc 
ccagacaaag 
tgatgaactg 
cgttgagtca 
cctgatggcc 
gcagcaaatg 
cgccaatctc 
ggtggttgaa 
gaaaaaccag 
ggcgcgcttt 
gctgcgtgag 
actcactggt 



gctgaagccc 
gatatggagc 
ccggccgcag 
tggcatctcg 
aaagcacggg 
ctgtactggg 
gagcagaaca 
gttgaaggtg 
gcggggaaaa 
tcattgaagc 
gggtactcct 
tcgctaagca 
ttcctgcaac 
ggcgttacgg 
gatatcgcaa 
gcggtgaatg 
catcaggcgc 
aacgcgggca 
cgggccaatg 
gcgttgggcg 



aggcgaaggc 
ggcaaaaaat 
gtacgaccgg 
atatctgggg 
cggcggaac'g 
agtggcaaac 
ccattatcgc 
tggaaaccga 
tgaaaattat 
ttaaaccggt 
tact ggcccg 
ccattgatgc 
aggatgcgtt 
caggcctgac 
aagccgaaag 
acgtggcgcg 
aaat tgagcg 
teat tgctgg 
gcctgttatt 
gggggtacaa 



agttgccact 
gtcggcagaa 
cccgtggtac 
aaagaat egg 
cgagcaaacc 
ccaggcggcg 
gaccgat cgc 
tattaatgee 
tgaggcaegg 
cgcgttgccg 
gegggcagat 
ggcaaaagcg 
gcacttaagc 
gctacccatt 
caacttgtct 
ggcagccagt 
egatgecttg 
ttcccgcgtc 
gcaagggcag 
aegctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
' 720 
780 
840 
900 
960 
1020 
1080 
114 , 0' 
1197 



<210> 145 
<211> 291 
<212> DNA 
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<400> 145 

atgtattgcc acgcgaaact aaaaaatata tcgcaacaca cggtaatctc cgcgcacctt 

ttcttacctg attattcccc catgaatcgt gattcctttt atccagccat cgcctgtttt 

ccgctgttac tgatgctggc cgggtgtgcg cctatgcatg aaacccgcca ggcgttaagc 

cagcaaacgc ccgctgcaca agttgacacc gcattaccca cggcgctgaa aatggttggc 

cagacagcca atggtggctg gagtatcacg ataatcaact cacttcctta a 

<21.0> 14 6 

. <211> 948 

<212> DNA 

<213> E. Coli 



60 
120 
180 
240 
291 



<400 
atgcgtgtgt 
accgaagtta 
ctgccggtaa 
tctggracgc 
gccgcccgtg 
aaaacggtta 
cagggtgcaa 
cgtctgggct 
ggcgctacgg 
attgactggc 
ggtgaaatct 
gtgatgattg 
gaaccgcgaa 
aagcagggcg 
cgtaaagaat 
cctgatattg 



> 146 
tactggcacc 
acgactacga 
aagtctttca 
tggtgcgcgt 
cggtggagtt 
acggtagcgg 
aagcgatgcg 
gggacagcgg 
agctggtggt 
aggcgattgg 
gggacrggca 
gtcgcggggc 
t gccgtggcc 
ataccgggtt 
acgat gaagc 
caagggctat 



gatggaggga 
tctgtgcatc 
tcgcat ttgc 
gcagttgtta 
aggttcctgg 
cggcggggcg 
tgaagctgta 
t gagaagaaa 
gcatgggcgg 
cgatat tcgc 
gagcgcgcaa 
gctcaat att 
ggaggtggtt 
atat cacgtt 
aacggaatt a 
tcaggcaatt 



gtgcttgact 
accgagtttg 
cctgagctac 
ggtcagttcc 
ggcgtggatc 
acgttactca 
ccggcgcatt 
tttgaaatcg 
acgaaagagc 
cagcggctga 
caatgcatgg 
cccaacctga 
gctttgctgc 
gcgcggatta 
tttcagcatg 
gatatcgaga 



ct ctggtgcg 
tccgcgtggt 
aaaacgccag 
cacaa tggct 
tcaattgcgg 
aagatcctga 
tgcccgtcag 
ccgatgcggt 
agggttaccg 
atattccggt 
cgatcagcgg 
gccgggtggt 
aaaaatatac 
aacagtggtt 
ttcgggtgtt 
aactctaa 



tgaattgctg 
ggatcaactg 
ccggacacca 
ggcagagaac 
ctgcccgt eg 
actcat ctac 
cgtgaaagtg 
tcaacaggct 
cgeggagcat 
gattgecaac 
ctgcgacgca 
aaaatataac 
ccgtctggaa 
gagttatttg 
gaataattcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
948 



<210> 147 
<211> 891 
<212> DNA 
<213> E . Coli 



<400 
atgacaatat 
ccagacaccg 
gatttaacga 
tattccgacc 
atagagcacc 
gcggcaatgc 
ctgttttatc 
gccgagcctg 
gtggtgccgg 
aaacatattc 
acagggaa.ca 
gatgcccagc 
gagaaactga 
gaagaagece 
.atgatggtga 



> 147 
cgacaacttc 
cgcgggattt 
cgcttaaact 
t ct tgtggtc 
aaagtaagee 
aaaaccatct 
atggttgcag 
ctatagcccg 
atgacgagat 
gtcagegega 
ctaatgacag 
gtt ttcgtgc 
tgaccattgc 
tgcgtattgc 
cccgactttc 



cacgccgcat 
tattgatatt 
ggaaccaaac 
tgt gaaaacg 
ggaagaatta 
tgatgeggge 
aagtccttat 
caaaatatat 
tatgeaacac 
tctgt tggga 
acagctaaaa 
atttattggt 
tgacagatta 
tcaggagatg 
accagacgat 



gatgeggtat 
catctt cccg 
agttttattg 
caggagggag 
atggcttttc 
tat aaagagc 
ccttattcac 
teateggett 
cgcaaaatgg 
ttagtcgacc 
gccctgttta 
gagatagegg 
cgtgaagaag 
ctggatagag 
cttatcgcgc 



ttaaatcttt 
cgccgctgcg 
atgaagacct 
tgggttatat 
geatgatgeg 
tt ccat tggt 
tetgetgget 
t tccgttggt 
cget gttgga 
aaattgtttc 
attaegtatt 
aacgcgcacc 
gegcaatgea 
gtttagacag 
aaagecacta 



tttacgecat 
caaact gtgt 
geggcaatat 
ttatgtagtg 
ttattccatt 
gctcccgatg 
tgatgaattt 
ggatattacc 
gttaattcag 
getgetagtt 
acaaacaggg 
acaagaaaag 
gggcaaacac 
agagttagtt 
a 



<210> 148 

<211> 1668 

<212> DNA 

<213> E. Coli 

<400> 148 

gtggctcaat tegtttatae catgeategt gtcggcaaag ttgttccgcc gaaaegtcat 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
600 
660 
720 
780 
840 
891 



60 
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attttgaaaa 

aatggcgcgg 

ggtgaagcgc 

aacccggaac 

ctgaaacgcc 

ctggccgctg 

aacgtacagc 

gctaacctct 

ccagacatgc. 

ctggaacgct 

ttcctcgata 

gaaggtaact 

caagaagcgg 

aaaggccgtc 

gaatatcaga 

gataaagtgc 

gacctgagct 

ggtaaatcga 

actttgggtg 

agcaaaaccg 

gaga tgccaa 

cgcgttggtg 

gtt ggcggca 

cgcgcgctgg 

cgttggttcc 

gagttcttcg 

gacgcgctgg 



acatctctct 
gtaagtccac 
gtccgcagcc 
acaccgtgcg 
tggatgaagt 
aacaaggccg 
tggagcgtgc 
ccggtggtga 
tgctgctcga 
t cctgcacga 
acgttgcagg 
actcctcctg 
cgcgtcgtaa 
agtcgaaagg 
aacgtaacga 
tggaagtcag 
tctcgatccc 
ccctgttccg 
aaacggtgaa 
tttgggaaga 
gccgcgccta 
aactctccgg 
acatgctgct 
aaaacgccct 
t cgaccgtat 
aaggtaactt 
agccgaagcg 



gagtttcttc 
cctgctgcgc 
agacatcaag 
tgagtccatt 
gtatgcgctg 
tctggaagag 
ggcggatgcg 
acgtcgtcgc 
cgaaccgacc 
cttcgaaggc 
ctggatcctc 
gctggagcag 
gtcgattgag 
taaagcacgt 
aaccaacgaa 
caacctgcgt 
gaaaggagcg 
tatgatctct 
actggcgtcg 
agtt tccggc 
cgttggccgc 
tggtgagcgc 
gctcgacgaa 
gctggagtt c 
cgccacgcac 
taccgagtac 
tatcaagtac 



cctggggcaa 
attatggcgg 
attggttatc 
gaagaagcgg 
tacgccgatc 
atcattcagg 
ctacgtctgc 
gtagcgttgt 
aaccacctgg 
accgttgtgg 
gaacttgacc 
aaagatcagc 
aaagagctgg 
ctggcgcgct 
ctgtttattc 
aaatcctatg 
atcgtcggga 
ggtcaggaac 
gttgatcagt 
gggctggata 
tttaacttta 
ggtcgtctgc 
ccaaccaacg 
ccgggctgtg 
attctggatt 
gaagagtaca 
aagcgtattg 



aaattggtgt 
gcattgataa 
tgccgcagga 
tttcagaagt 
cggatgccga 
ctcacgacgg 
cggactggga 
gccgcctgct 
atgccgaatc 
cgattaccca 
gcggtgaagg 
gcctggcgca 
aatgggtacg 
t tgaagaact 
cacctggacc 
gcgatcgtct 
tcatcggt cc 
agccggacag 
tccgtgactc 
tcatgaagat 
aaggggttga 
atctggcgaa 
acctggatat 
cgatggttat 
accaggatga 
agaaacgcac 
cgaagt aa 



cctgggtctg 

agacat cgaa 

accgcagct g 

ggttaacgcc 

t tttgacaag 

tcataatctg 

cgcgaaaat c 

gctggaaaaa 

cgtggcctgg 

cgaccgttac 

tattccgtgg 

ggaagcttca 

tcaaggtact 

gaacagcacc 

gcgtctgggc 

gctgattgat 

gaacggtgcg 

cggcaccatc 

aatggataac 

cggcaacacc 

tcagggtaaa 

gctgctgcag 

cgaaaccctg 

ct cgcacgac 

aggtaaagtt 

gctgggcgca 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1668 



<210> 149 
<211> 522 
<212> DNA 
<213> E . Coli 



<400 
atgtcaaagc 
actgatgatg 
acatgggttg 
gttagtgctg 
tcccttaatc 
ctgaaggtct 
agaaacattg 
gaccgacgca 
ctaaaaaagc 



> 149 
caaaataccc 
gttacaggat 
ccctctatga 
atccagagtt 
aggctgctgc 
a t gaagagcg 
caatatcagt 
ttgaggatet 
tgaaagcctt 



<210> 150 
<211> 852 
<212> DNA 
<213> E. Coli 



ttttgaaaag 
catctcggca 
aaaacatgga 
gcgtattaag 
tcactttatg 
cggagaagct 
tgatccagaa 
tgaaaggcaa 
agctcatccc 



cgccttgaag 

cgttttggtg 

gaaaaaggtt 

gtcgtgaaag 

cttgctggta 

ggtttacgcg 

aaagcggcat 

gttcgatttc. 

acgaaaaagt 



tcgtgaatca 
tcccccgaac 
taattcccaa 
ctgtgatcga 
gtggttctgt 
cgctcaagat 
cagcattgga 
t.tgaaacgcg 
ga 



ctacttcaca 
ccaggt cagg 
acctaaaggc 
gcagcacat g . 
agccaggtgg 
tggcaccaaa 
gctgtcaaaa 
gcttat gtat 



60 
120 
180 
240 
300 
360 
420 
480 
522 



<400 
gtgaaagtac 
gagataccgc 
gcggacgtta 
cgtagggtaa 
cgcctgatgg 
agaggagagg 
ccaaacgaga 
ttgtctccag 
ccagtgatga 
gagcatcctg 



> 150 
tcaacgagct 
gcagtacgtt 
aaaagcgtat 
cgctgtctct 
gaaccctctc 
tagggcaaac 
agtgggttac 
taatagatct 
acatggttga 
ttctgcactc 



aaggcagtt t 
ttattatcat 
tagtgagatt 
tcatcgagaa 
acttaaagca 
cgcccct aat 
cgatgttact 
cttcaacaac 
gaatatgctc 
tgaccaggga 



tatcctcttg 
ctaaaggctc 
tatcacgaga 
gggaaacaga 
gcgattaagg 
gttctccaaa 
gaatttgcag 
gaagttattt 
gatcaggcat 
tggcagtat c 



atgagcttct 
tcagcaagcc 
atagaggccg 
ttaaccataa 
tcaagcgat a 
gagatttcaa 
tcaatgggcg 
cttacagcct 
tcaaaaagct 
gtatgagaag 



cagggctgcg 
tgacaagtat 
atacggatac 
agctgttcag 
ccgctcttac 
ggctacgcgg 
caagctgtat 
ttcggaaaga 
taatcctcac 
atatcaaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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atccttaaag aacatggtat taaacaaagc atgtccagaa aaggcaattg tctggataat 660 

gctgtggtgg agtgtttctt tggaacctta aagtcggagt gtttttatct tgatgagttc 720 

agtaatataa gcgaactgaa ggatgctgtt acggaatata ttgaatacta caacagcaga 780 

agaattagcc tgaaattaaa aggtctgact ccaattgaat atcggaatca gacctatatg 840 

cctcgtgttt aa " " q^2 



<210> 151 
<211> 117 
<212> DNA 
<213> E . Coli 



- <400> 151 

atgaaagttc gtgcttccgt caagaaatta tgccgtaact gcaaaatcgt taagcgtgat 60 

ggtgtcatcc gtgtgatttg cagtgccgag ccgaagcata aacagcgcca aggctga 117 

<210> 152 
<211> 1332 
<212> DNA 
<213> E. Coli 

<400> 152 

atggctaaac aaccgggatt agatttt caa agtgccaaag gtggcttagg cgagctgaaa 60 

.cgcagactgc tgtttgt t at cggtgcgctg attgtgttcc gtattggctc ttt'tattccg 120 

atccctggta ttgatgccgc tgtacttgcc aaactgcttg agcaacagcg aggcaccatc 180 

attgagatgt ttaacatgtt ctctggtggt gctctcagcc gtgcttctat ctttgctctg ■ 240 

gggatcatgc cgtatatttc ggcgtcgatc attatccagc tgctgacggt ggttcaccca 300 

acgttggcag aaattaagaa agaaggggag tctggtcgtc gtaagatcag ccagtacacc 360 

cgctacggta ctctggtgct ggcaatattc cagtcgatcg gtattgctac cggtctgccg 420 

aatatgcctg gtatgcaagg cctggtgatt aacccgggct ttgcattcta cttcaccgct 480 

gttgtaagtc tggtcacagg aaccatgttc ctgatgtggt tgggcgaaca gattactgaa 540 

cgaggtatcg gcaacggtat ttcaatcatt atcttcgccg gtattgtcgc gggactcccg '600 

ccagccattg cccatactat cgagcaagcg cgtcaaggcg acctgcactt cctcgtgttg 660 

ctgttggttg cagtattagt atttgcagtg acgttctttg' ttgtatttgt tgagcgtggt 720 

caacgccgca ttgtggtaaa ctacgcgaaa cgtcagcaag gtcgtcgtgt ctatgctgca 780 

cagagcacac atttaccgct gaaagtgaat . atggcggggg taatcccggc aatcttcgct 840 

tccagtatta ttctgtt ccc ggcgaccatc gcgtcatggt tcgggggcgg tactggttgg 900 

aactggctga caacaatttc gctgtatttg cagcctgggc aaccgcttta tgtgttactc 960 

tatgcgtctg caatcatctt cttctgtttc ttctacacgg cgttggtttt caacccgcgt 1020 

gaaacagcag ataacctgaa gaagtccggt gcatttgtac caggaattcg tccgggagag 1080 

caaacggcga agtatatcga taaagtaatg acccgcctga ccctggttgg tgcgctgtat 1140 

attaccttta tctgcctgat cccggagttc atgcgtgatg caat gaaagt accgttctac 1200 

ttcggtggga cctcactgct t atcgttgtt gtcgtgatta tggactttat ggctcaagtg 1260 

caaactctga tgatgtccag tcagtatgag tctgcattga agaaggcgaa cctgaaaggc 1320 

tacggccgat aa 1332 

<210> 153 

<211> 435 * 
<212> DNA 
<213> E. Coli 



<400> 153 
atgcgtttaa atactctgtc tccggccgaa 
cgtggtatcg gttctggcct cggtaaaacc 
cgttctggcg gtggcgtacg tcgcggtttc 
ctgccgaaat tcggcttcac ttctcgtaaa 
gacctggcta aagtagaagg cggtgtagta 
atcggtatcc agatcgagtt cgcgaaagtg 
actgttcgtg gcctgcgtgt tactaaaggc 
aaaatcgagg aataa 

<210> 154 



ggctccaaaa aggcgggtaa acgcctgggt 60 

ggtggtcgtg gtcacaaagg tcagaagtct 120 

gagggtggtc agatgcctct gtaccgtcgt 180 

gcagcgatta'cagccgaaat tcgtctgtct 240 

gacctgaaca cgctgaaagc ggctaacatt 30.0 

atcctggctg gcgaagtaac gactccggta 360 

gctcgtgctg ctatcgaagc tgctggcggt 420 

435 
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<400> 154 

atggcaaaga ctattaaaat tactcaaacc cgcagtgcaa tcggtcgtct gccgaaacac 

aaggcaacgc tgcttggcct gggtctgcgt cgtattggtc aca.ccgtaga gcgcgaggat 

actcctgcta ttcgcggtat gatcaacgcg gtttcctfca tggttaaagt tgaggagtaa 

<210> 155 
<211> 504 
<212> DNA 
<213> E. Coli 



60 
120 
180 



<400 
atggctcaca 
gtatctaaaa 
gatggtaacg 
cagaaagcga 
ctgcaacacc 
gaaggtaccg 
cataacgttc 
attgatggcc 
gtt gaagaaa 



> 155 
t cgaaaaaca 

. ccgt taaagg 
gtcgcgttgg 
tggaaaaagc 
ctgttaaagg 
gtatcatcgc 
t ggctaaagc 
tggaaaatat 
ttctggggaa 



agctggcgaa 
tggt cgtatt 
ttttggttac 
ccgtcgcaat 
tgttcacacg 
cggtggt gca 
ct atggttcc 
gaatt ctcca 
at aa 



ctgcaggaaa 
ttctccttca 
ggtaaagcgc 
atgattaacg 
ggttctcgcg 
atgcgcgccq 
accaacccga 
gaaatggtcg 



agctgat cgc 
cagctctgac 
gtgaagttcc 
t cgcgctgaa 
tattcatgca 
ttctgg.aagt 
tcaacgtggt 
ctgccaagcg 



<210> 156 
<211> 354 
<212> DNA 
<213> E . Coli 



<400 
atggataaga 
ctgggcgcaa 
gcaccgaacg 
caact gaagt 
cgcgctctgg 
ggtcgtgtcc 



> 156 
aatctgctcg 
ctcgcctggt 
gttctgaagt- 
acaccggtaa 
aaaaaggcat 
aggcactggc 



t a tccgt cgt 
ggtacatcgt 
tctggtagct 
caaagacgcg 
caaagatgta 
agat gctgcc 



gcgaccc.gcg 
accccgcgtc 
gcttctactg 
gctgcagctg 
tcctttgacc 
cgtgaagctg 



cacgccgcaa 
acatttacgc 
t agaaaaagc 
tgggtaaagc 
gttccgggtt 
gccttcagtt 



<210> 157 
<211> 534 
<212> DNA 
<213> E. Coli 



<400 
atgtctcgtg 
ggt caggtta 
gttgaagtta 
ggttgggcac 
gaaggcttca 
aatgtgatta 
atcactgctg 
atcggccagg 
ggtgttcgtt 



> 157 
ttgctaaagc 
ttacgatcaa 
aacat gc'aga 
aggctggtac 
ctaagaagct 
acctgtctct 
aatgt ccgac 
ttgcagcgga 
acgccgacga 



accggtcgt t 
aggtaaaaac 
taat accctg 
cgcgcgtgcc 
gcagctggtt 
gggt t t ctct 
tcagactgaa 
tctgcgcgcc 
agtcgtgcgt 



gttcctgccg 
ggcgagctga 
accttcggtc 
ctgctgaact 
ggtgtaggtt 
catcctgttg 
atcgtgctga 
taccgtcgtc 
accaaagagg 



gcgttgacgt 
ctcgtactct 
cgcgtgatgg 
caat ggttat 
accgtgcagc 
accatcagct 
aaggcgctga 
ctgagcctta 
ctaagaagaa 



<210> 158 
. <211> 393 
<212> DNA 
<213> E . Coli 



ggtaaaccgc 
tgtagttggc 
agcagcgatc 
taacggcact 
gccggct t cc 
cgctggggtt 
tcgtgcaact 
tggtaaatcc 



gctccaggag 
acaggtaatt 
tatcgctgaa 
tgtcgctgaa 
ccaatatcat 
ctaa 



aaaaat caac 
caacgatgct 
t tacgcagac 
cggtgt tacc 
ggttaaaggc 
gcctgcgggt 
taagcaggtg 
taaaggcaag 
gtaa 



60 
120 
180 
240 
300 
360 
420 
480 
504 



60 
120 
180 
240 
300 
354 



60 
120 
180 
240 
300 
360 
420 
480 
534 



<400> 158 

atgagcatgc aagatccgat cgcggatatg ctgacccgta tccgtaacgg tcaggccgcg 



60 
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aacaaagctg cgg^^Ecat gccttcctcc aagctgaaag tggcaatcgc caacgtgctg 
aaggaagaag gttttattga agattttaaa gttgaaggcg acaccaagcc tgaactggaa 
cttactctga agtatttcca gggcaaagct gttqtagaaa gcattcagcg tgtcagccgc 
ccaggtctgc gcatctataa acgtaaagaf gagctgccga aagttatggc gggtctgggt 
atcgcagttg tttctacctc taaaggtgtt atgactgatc gtgcagcgcg ccaggctggt 
cttggtggcg aaattatctg ctacgtagcc taa 



120 
180 
240 
300 
360 
393 



<210> 159 
<211> 306 
<212> DNA 
<213> E . Coli 

<400> 159 

atggctaagc aatcaatgaa agcacgcgaa gtaaaacgcg tagctttagc tgataaatac 60 

ttcgcgaaac gcgctgaact gaaagcgatc atctctgatg tgaacgcttc cgacgaagat 120 

cgttggaacg ctgttctcaa gctgcagact ctgccgcgtg attccagccc gtctcgtcag 180 

cgtaaccgct gccgtcaaac aggtcgtccg catggtttcc tgcggaagtt cgggttgagc 240 

cgtattaagg tccgtgaagc cgctatgcgc ggtgaaatcc cgggtctgaa aaaggctagc 300 

tggtaa . 306 

<210> 160 

<211> 540 

<212> DNA 

<213> E . Coli 



<400 
atggcgaaac 
aactacaatt 
ggtgaagcga 
tccggtcaaa 
cagggctatc 
gagcgcctga 
tctttcgacg 
atcgactacg 
aaatctgacg 



> 160 • 
tgcatgatta 
ctgtcatgca 
tcgctgacaa 
aaccgctgat 
cgatcggctg 
tcactattgc 
g teg tggtaa 
ataaagtcga 
aagaaggccg 



ctacaaagac 
agtccctcgg 
aaaactgctg 
caccaaagca 
taaagtaact 
tgtacctcgt 
ctacagcatg 
ccgcgttcgt 
cget ctgctg 



gaagtagtta 
gt cgagaaga 
gataaegcag 
cgcaaatctg 
ctgcgtggcg 
ateegtgact 
ggtgtccgtg 
ggtttggata 
getgectttg 



aaaaact cat 
tcaccctgaa 
cagcagacct 
ttgeaggett 
aacgcatgtg 
t ccgtggcct 
agcagat cat 
ttaccattac 
acttcccgtt 



gactgagttt 
catgggtgtt 
ggcagcaatc 
caaaatccgt 
ggagttcttt 
gtccgctaag 
cttcccagaa 
cactactgcg 
ccgcaagtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 



<210> 161 
<211> 315 
<212> DNA 
<213> E. Coli 

<400> 161 

atggcagcga aaatcegteg tgatgacgaa gttatcgtgt taaccggtaa agataaaggt 60 

aaacgcggta aagttaagaa tgtcctgtct teeggcaagg tcattgttga aggtatcaac 120 

ctggttaaga aacatcagaa gccggttccg gccctgaacc aaccgggtgg catcgttgaa 180 

aaagaagccg ctattcaggt ttccaacgta gcaatcttca atgeggcaac eggcaagget 240 

gaccgtgtag gctttagatt egaagaeggt aaaaaagtcc gtttcttcaa gtctaacagc 300 

gaaactatca agtaa 315 

<210> 162 
<211> 372 
<212> DNA 
<213> E. Coli 



<400> 162 



atgatccaag 
tgtatcaagg 
atcaccatca 
gtagtggtgc 
ggtaatgc.tt 
gggccggtaa 



aacagactat 
ttctgggtgg 
aagaagcaat 
gcaccaagaa 
gtgttcttct 
ctegtgaget 



get gaacgt c 
ct cgcaccgt 
t ccgcgtggt 
gggtgt tegt 
gaacaacaac 
tegtagtgag 



gccgacaact 
cgctacgcag 
aaggtcaaaa 
cgcccggacg 
agegagcage 
aagttcatga 



ccggtgcacg 
gegtaggega 
aaggtgatgt 
gttctgtcat 
etateggtae 
aaattatctc 



tegegtaatg 
cat catcaag 
getgaaggeg 
tegcttcgat 
gcgtattttt 
tctggcacca 



60 
120 
180 
240 
300 
360 
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<210>' 163 
' <211> 567 • 
<212> DNA 
<213> E. Coli 

<400> 163 

atgtttaaag gacaaaaaac attggccgca ctggccgtat 

gtttatgctg ctgatgaagg ttctggcgaa attcacttta 

ccttgtgaaa . ttcatccaga agatattgat aaaaacatag 

acccatataa accgggagca tcatagcaat aaagtggccg 

tgtgatctgc ctgcttctga caacggtagc ggaatgccgg 

ttcgatagca cggctaagac aactggtgct acgcctttgt 

gaagcaactg gggtcggtgt acgactgatg gacaaaaatg 

tcagccgcgc cagatcttga cctggatgca agctcatcag 

gcctggatgg aacaaattga taatgcagtc gatgtcacgg 

gcaacctacg tgctggatta taaataa 

<210> 164 
<211> 1284 
<212> DNA 
<213> E. Coli 

<400> 164 

atggctgata caaaagcaaa actcaccctc aacggggata 
ctgaaaggca cgctgggtca agatgttatt gatatccgta 
ttcacctttg acccaggctt cacttcaacc gcatcctgcg 
gatggtgatg aaggtatttt gctgcaccgc ggtttcccga 
tctaactacc tggaagtttg ttacatcctg ctgaatggtg. 
tatgacgaat ttaaaactac ggtgacccgt cataccatga 
ctgttccatg ctttccgtcg cgactcgcat ccaatggcag 
gcgctggcgg cgttctafca cgactcgctg gatgttaaca 
gccgcgttcc gcctgctgtc gaaaatgccg accatggccg 
at'tggtcagc catttgttta cccgcgcaac gatctctcct 
atgatgttct ccacgccgtg cgaaccgtat gaagttaatc 
gaccgtiattc tgatcctgca cgctgaccat gaacagaacg 
accgctggct cttcgggtgc ga'acccgttt gcctgtatcg 
tggggacctg cgcacggcgg tgctaacgaa gcggcgctga 
tccgttaaac acattccgga atttgttcgt cgtgcgaaag 
ctgatgggct tcggtcaccg cgtgtacaaa aattacgacc 
gaaacctgcc atgaagtgct gaaagagctg ggcacgaagg 
atggagctgg aaaacatcgc gctgaacgac ccgtacttta 
aacgtcgatt tctactctgg tatcatcctg aaagcgatgg 
accgtcattt tcgcaatggc acgtaccgtt ggctggatcg 
agtgacggta tgaagattgc ccgtccgcgt cagctgtata 
tttaaaagcg atatcaagcg ttaa 

<210> 165 

<211> 1434 

<212> DNA 

<213> E . Coli 



PCT/US00/02200 

-372 



ctctgctgtt 

agggggaggt 

at cttggaca 
tcgacattcg 
tatccaaagt 
t gagcaacac 
acggtaacat 
aacagacgct 
caggtgaagt 



cactgcacct 
tatt gaagca 
agtcacgaca 
cttgatcaac 
tggcgt aacc 
cagtgcaggc 
cgtattaggt 
gaactttttc 
aaccgctaac 



cagctgttga 

ctctcggttc- 

aat ctaaaat 

tcgatcagct 

aaaaaccgac 

t ccacgagca 

tcatgtgtgg 

atcctcgtca 

cgatgtgtta 

acgccggtaa 

cgattctgga 

cctctacctc 

cagcaggtat 

aaatgctgga 

acaaaaatga 

cgcgcgccac 

atgacctgct 

tcgagaagaa 

gtattccgtc 

cccactggag 

caggatatga 



actggatgtg 
aaaaggtgtg 
tacttttatt 
ggcgaccgat 
tcaggaacag 
gattacccgt 
tattaccggc 
ccgtgaaatt 
caagtattcc 
c-ttcct gaat 
acgtgctatg 
caccgt gcgt 
tgctt cactg 
agaaat cage 
ttcttt cege 
cgtaat gcgt 
ggaagtggct 
actgtacccg 
ttccatgttc 
cgaaat gcac 
aaaacgegae 



<400 
atgaaagtaa 
ctggatcgtt 
gttaaagtga 
gcttctctcg 
gcgctgagta 
catccgacca 
tttgaagaag 



> 165 
cgctgccaga 
actggtaegg 
ataccatcga 
gtgctaatgc 
aatctctggc 
ttaccaaatt 
gtttcgaagg 



gtttgaacgt 
ccccaccagt 
agaaegtccg 
acgcctgg.tc 
cgacgtcaac 
aegggtaett 
tgttgatccg 



gcaggagtga 
cgtatctcgc 
ggeggegegg 
gggttgacgg 
gtcaaatgcg 
t cccgcaacc 
cagccgctgc 



tggtggttgg 
cggaagcgcc 
ctaacgtggc 
gcattgacga 
acttegttte 
aacagctgat 
aegageggat 



tgatgt gatg 
ggtgcccgtg 
gatgaatatc 
tgcagcgcgc 
tgtaccgacg 
ccgtctggat 
taatcaggcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
567 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1284 



60 
120 
180 
240 
300 
360 
420 
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ctgagttcga ttggcfCgct ggtgctttct gactacgcca 
cagcagatga tccaactggc gcgtaaagcg ggtgttccgg 
accgattttg agcgctaccg cggcgctacg ctgttaacgc 
gctgttgtcg gtaaatgtaa gaccgaagaa gagattgttg 
gccgattacg aactctcggc tctgttagtg acccgttccg 
caaccgggta aagcgccgct gcatatgcca acccaagcgc 
ggtgcgggcg acacggtgat tggcgtcctg gcggcaacgc 
gaagaagcct gcttctttgc caatgcggcg gctggcgtgg 
tccacggttt cgccgatcga gctggaaaat gctgtacgtg 
ggcgtgatga ccgaagagga actgaagctg gccgtagcgg 
aaagtggtga tgaccaacgg tgtctttgac atcctgcacg 
gcaaatgccc gcaagctggg tgaccgcttg attgttgccg 
aaacggctga aaggggattc ccgcccggta aacccactcg 
ggcgcactgg aagcggtcga ctgggtagtg tcgtttgaag 
atcgccggga tcttgccaga tctgctggtg aaaggcggcg 
gccgggagta aagaagtctg ggccaacggt ggcgaagtgt 
ggttgctcga cgaccaacat catcaagaag atccaacagg 

<210> 166 

<211> 2841 

<212> DNA 

<213> E . Coli 



PCT/USOO/02200 



aaggtgcgct 
tgctgat tga 
cgaatctctc 
agcgcggcat 
aacagggtat 
aggaagtgta 
tggcaqcggg 
tggtcggcaa 
gacgtgcaga 
cagcgcgt aa 
ccgggcacgt 
tcaacagcga 
aacagcgtat 
aggacacgcc 
actataaacc 
tggtgctcaa 
ataaaaaagg 



ggcaagcgta 
tccaaaaggt 
ggaatt tgaa 
gaaactgatt 
gtcgctgctg 
tgacgttacc 
taattcgctg 
actgggaacc 
tacaggcttt 
acgtggtgaa 
ctcttatctg 
tgcctccacc 
gattgtgctg 
gcagcgcttg 
agaagagatt 
ctttgaagac 
ctaa 



480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1434 



<400> 166 

atgaagccgc tctcttcacc gttacagcag tactggcaga ccgttgttga 
gagcctttag ccgaggaatc acttagcgca caggcgaagt cagtacttac 
tttgtgcagg- acagcgtgat tgcgcatcca gagtggctga cggaactgga 
ccgcaggccg acgaatggca gcattacgcg gcatggttgc aggaggcgct 
agtgacgaag ccgggttaat gcgcgagctg cggctattcc ggcggcgcat 
atcgcctggg cgcaaacgct ggcactggtt actgaagaga gcatattgca 
tatctggcgg agacgctgat tgttgcggcg cgtgactggc tgtat-gacgc 
gagtggggaa cgccgtgcaa tgcgcagggc gaagcgcaac cgctgctgat 
ggtaagctgg gcggtggg<3 : a gctgaatttc tcctctgata tcgatctgat 
ccggaacatg gttgtacgca gggtggacgc cgggaactgg ataacgcgca 
cgcatggggc agcggctgat taaagtgctg gatcaaccaa cgcaggatgg 
cgcgtggata tgcggctgcg tccgtttggc gaaagtggcc cgctggtgct 
gcgttggaag attattacca ggagcagggg cgcgactggg agcgttacgc 
gcgcggatta tgggcgatag cgaaggcgtc tatgctaacg agttgcgtgc 
ccgtttgttt tccgtcgtta catcgatttc agcgtgattc agtcgctgcg 
gggatgattg cccgtgaagt gcgtcgacgt ggtttgaccg acaatatcaa 
ggcggcattc gcgaaattga atttatcgtt caggtgttcc agctcattcg 
gaaccgtcgc cgcaatcgcg ctctttactg ccaacgctca gcgccattgc 
ctgctttctg aaaacgatgc tgaacaattg cgagtggcgt atctgttcct 
gaaaacctgc tgcaaagcat taacgacgaa caaacccaga cgcttccttc 
aatcgtgcgc ggctggcgtg ggcgatggac tttgctgact ggccgcaact 
ctgaccgcac atatgaccaa tgtgcgccgg gtgtttaatg aattgattgg 
agtgaaactc aggaagagtc gctgtcggaa cagtggcgtg agctgtggca 
caggaagatg acactacgcc agtgctggcg catcttagcg aggatgatcg 
ctaacgctga ttgccgattt ccgcaaagag ctggataagc gcaccatcgg 
cgtcaggtgc tcgaccatct gatgccgcat ctgctaagtg atgtctgtgc 
gctgccgtta cgctgtcgcg cattaccgcc ttgctggtgg ggattgttac 
tatttagaat tgctcagtga attccccgcg gcgcttaaac atttgatttc 
gcgtcgccga tgattgccag ccagctggcg cgttatccat tattgctgga 
gatccaaaca ccctttacca gccgacggcg accgatgcct accgcgatga 
tatttgctgc gcgtgccgga agatgacgaa gagcaacagc ttgaggcgct 
aaacaggcgc agctgttacg catcgccgca gcggatatcg ccggtacgct 
aaagtgagcg atcacttaac ctggctggcg gaagccatga tagatgccgt 
gcgtgggttc aaatggttgc ccgctacggt aagccgaatc acctgaacga 
cgtggttttg cggtggtcgg ctacggcaag ctgggcggct gggag.ttagg 
gatcttgacc ttatcttcct ccatgattgc ccaatggatg cgatgactga 
gaaatcgacg ggcggcagtt ttatctgcgt ctggcgcaac gcattatgca 



gcggctgcca 
ttttagtgat 
aagccaaccg 
ctgtaatgtg 
tatggtgcgc 
gcagctcagc 
ctgctgccgc 
tttaggcatg 
ttttgcctgg 
gttttttacc 
cttcgtctat 
gagctttgcc 
gatgg'tcaag 
gatgctgcgc 
caacatgaaa 
act cggcgca 
cggcggacgc 
cgagctgcat 
gcggcgtctg 
tgatgagctt 
gaccggggcg 
cgacgatgaa 
ggatgcgttg 
caaacaggtg 
gccgcgagga 
gcgtgaagac 
ccgcaccacc. 
tctgtgtgcc 
tgaattgctc 
gttgcgccag 
gcgtcagttc 
accggtgatg 
cgttcagcag 
acgcgaaggg 
ctacagt tec 
eggtgagegg 
tctgttcagt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140. 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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acgcgtacct 
gcgggaatgc 
acgtgggaac 
gcgcact'ttg 
caaacggaag 
gat cgctttg 
tatctggt;gt 
cgtattctgg 
acccgtgctt 
ggccatgtgt 
aagtggctgg 



cttccggcat 
tggtgacatc 
at caggcgct 
acgcagtgcg 
tgcgggaaat 
at atcaaagc 
t gcgctacgc 
a'actactggc 
acactacgct 
cggaggattg 
tggaagaatg 



<210> 167 
<211> 1302 
<212> DNA 
<213> E. Coli 



tttgtatgaa 
cgcagaagca 
ggtgcgt gcg 
tcgcgagatt 
gcgcgagaaa 
t ga tgaaggg 
t ca tgaaaaa 
gcaaaacgac 
t cgcgatgaa 
cttcaccgca 
a 



gtggatgctc 
tttgccgatt 
cgtgtagtgt 
atgacgctgc 
atgcgcgctc 
ggaattaccg 
ccgaagttaa 
attatggaag 
cttcatcatc 
gagcgtgaac 



gactgcgtcc 
at cagaaaaa 
acggcgatcc 
cgcgtgaagg 
atctcggcaa 
atatcgaatt 
cgcgctggtc 
agcaggaagc 
tggcattaca 
tggtgcgggc 



gtccggggcg 
cgaggcctgg 
gcagctcacc 
taaaactctg 
taaacatcgc 
tattacccaa 
agacaacgtg 
gatggcgctg 
ggaattgccg 
aagctggcag 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2841 



<400 
atggctcagg 
gaccatctca 
tacgaaacgc 
aacggtcgct 
cgcccggaat 
gaagt ctggc 
agcaccgatt 
gccctcgacc 
ctggaactgc 
caaaccggat 
ggcaatccgg 
gtggaacagg 
gaactgtggg 
gtccgtcata 
cgtgatctgc 
gtctactcta 
gcatggcagc 
tttgccgata 
ttaggcgatc 
ttgctggcgg 
ctgcatcacg 
aacaat cagg 



> 167 
aaatcgaatt . 
atacgctggg 
cggataactg 
atgagat gac 
at aacgtggc 
cgaacggcga 
t ttatcgcga 

a gggggaagt 

tt agcggcga 

tacgccaggg 

cgcgtgaaat 

ggctggaagc 

t acgcggcaa 

cgctgatgct . 

tgactcaatg 

ccgaaacggc 

catttttaga 

tccatctttc 

gctaccgtga 

gttactatga 

ctattgcgac 

aaccgttctg 



aaagtttatt 
cggcgagcac 
gctgcgtggg 
cat gaaagtt 
gttgagcgaa 
att gcccgcc 
aaaatggct g 
gaaagcgggt 
cacgcgcgcg 
cagcctgagc 
caaaccgacc' 
ggcgctcgag 
cga.tgcggcg 
gttcggtggt 
cgaggcgacc 
aatggcgaag 
tgccaaagcg 
ccgccatgcc 
ccagttgcca 
tcctgtcgtc 
cgggcaacgc 
gttgcacagc 



gttaatcaca 
catgaccccg 
cacgatatgg 
gcaggaagag 
ccgacgctcg 
gatct cgcct 
gtggcggtcg 
gaatttgctg 
gtgctgaaac 
aaagcggcgc 
accattttgc 
ctggcgttag 
aaagaacagg 
attgtgccgc 
attgcttctg 
ctggcgttga 
cagggcaaaa 
gctgaactga 
cgcctgacgc 
gcgcaagcct 
at cgaaattg 
ggaaaacgtt 



gtgccgttga 
tgcagttgct 
gcttacgtat 
tgacaggcgg 
acctggcgca 
cccgcgtgca 
atggtagcca 
aacctatctg 
tggcgaacca 
gtggctatca 
atgttgcggc 
cgcaatggca 
tgctggcagc 
gtaaagcgag 
cggtgtctgc 
ccgaatggt t 
tcagcgactc 
aaagcgtttt 
gtgatattga 
ggctggagaa 
aacatttccg 
aa 



ggcgttgcgt 
gaatatttac 
t cgtggcgaa 
ctt aca t cag 
gttaccgacg 
gccgctgttc 
aattgaaatc 
tgagctggaa 
actggtatcg 
tctggcgcag 
aaaagccgat 
gtatcatgaa 
cattagcctg 
cactcactta 
cgtgacggcg 
ggtaagcaaa 
cttcaaacgc 
ctgccagccg 
ctcaatactg 
ctggcagggg 
taatgaggca 



<210> 168 

<211> 213 

<212> DNA 

<213> E. Coli 

<400> 168 

atgtccggta aaatgactgg tatcgtaaaa tggttcaacg ctgacaaagg cttcggcttc 

atcactcctg acgatggctc taaagatgtg ttcgtacact tctctgctat ccagaacgat 

ggttacaaat ctctggacga aggtcagaaa gtgtccttca ccatcgaaag cggcgctaaa 

ggcccggcag ctggtaacgt aaccagcctg taa 

<210> 169 

<211> 1572 

<212> DNA 

<213> E. Coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1302 



60 
120 
180 
213 



<400> 169 

atgagggaca ttgtggaccc tgtattctct atcggtatct catcattatg ggatgagctg 60 

cgacatatgc cagcaggcgg cgtctggtgg tttaacgtcg atcgccatga agatgctatc 120 

agtctggcga atcaaacaat tgcatcccag gctgaaaccg cacacgtcgc ggtcattagc 180 



-57- 
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atggacagcg atccg^cgaa aatctttcaa ttagatgatt 

aaattatttt caatgctaaa tcatgaaaaa ggtctatact 

tgttctattg atccccataa ttaccttttt attcttgttt 

aacattcctg ccgagcggct tcgctcatgg ttggataaaa 

aaccattgtt cgcttttggt aattaatccc ggaaataata 

ttgcttgagg aataccgttc actttttggt cttgccagtt 

catttgctgg atattgcctt ctggtgcaac gaaaaagggg 

agcgttcagc aacaaaatgg tatctggaca ttagttcaaa 

•ccacgcagcg acgaaaaacg cattctgagt aatgttgctg 

ctatcggaac actggcaact gttcaacaat aacgaagtcc 

gctcaggcgg cgacggtggt cttttcttta cagcaaaatg 

cgcagcattc ataccctgcg tcgccagcgc ggtagtgcga 

aataccgcta gcctgcgcgc caccgatgaa cgtttgttat 

gttattccgt ggaatgcgcc actctcccgt tgtctgacga 

cagaagttta gtcgctatgt gccggaagat atcactacct 

ctcaaactgc gtggtttcca gaagtgggat gtgttctgta 

aataaccctc tattacctgc ccacggtaaa ggcgttctgg 

ggtatccgcg ttgaacaagc cctgacgctg tgtcgcccta 

accattggcg gtaatcggct ggtgctgttt ctctcattct 

accgcgttga atcatatttt cccattgcct actggcgaca 

tggtttgaag atgatcaaat cagtgccgag ctggtgcaga 

caatggggca tgccgctgcc tttaacgcaa agttctaaac. 

gatggtcgcc actggcgacg aataccagaa cccatgcgac 

cgctcatcat ga 

<210> 170 
<211> 189 ■ 
<212> DNA 
<213> E . Coli" 



ctcaagggcc 
at ttgacccg 
gcgcaaataa 
tgaat aaatg 
acgataaaca 
tgcgtt ttca 
tcagcgcccg 
gcgaagaggc 
tactggaagg 
tgtt caatga 
cgcaaatcga 
tgaaaatcct 
tggcctgcgg 
tgatcgaaag 
tgctgtcaat 
atgccgtcaa 
ttgccctacg 
accgtaccgg 
gtcggattaa 
t tt t ct caaa 
tgcgcttgct 
cggtcatcaa 
tgttagatga 



ggaaaaaata 
tgattt-gcag 
cgcatggcaa 
gagcaggtta 
attttcattg 
gggtgaccaa 
tcagcagctt 
ggagatccaa 
tgcgccgccg 
agcccgtacc 
gccact ggcc 
cgtgcgggaa 
tgcaaa tatg 
cgtgcaaggg 
gacccagccg 
caacat gatg 
tccggt accg 
cgatat.catg 
cgatct ggat 
ccgtat ggtc 
tgccccagaa 
tgccgagcac 
tgctgtggag 



240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1572 



<400> 170 

atgaccatca gcgatatcat tgaaattatt gtcgtttgcg cactgatatt tttcccgctg 
ggctatctgg cgcggcactc tttgcgacgc attcgcgaca ccttacgttt gttctttgct 
aaacctcgtt atgttaaacc ggccgggacg ttacgccgca cggaaaaagc cagggcaacc 
aa.aaaatga 

<210> 171 
<211> 1680 
<212> DNA 
<213> E. Coli 



60 
120 
180 
189 



<400> 171 



atgactcaat 
ctttccggct 
aacttccatc 
tacagcctgc 
catgacacct 
ttcagtaccg 
gccatttttg 
tttgtggttg 
ttgtggccag 
accgttgcgg 
ccaacaacgg 
aaatcgacct 
aacatctgtt 
tggtcgcatt 
gcggcgatcc 
ccggcaaata 
ctgatgatgg 
ggcatgcaga 
ggttcgccgg 



ttacgcaaaa 
ggaactt eta 
cgctcctcaa 
ategcttgeg 
ggttgcctgg 
attatttaat 
ttttattagt 
ccatact get 
ccggacaacc 
cgacgggtgg 
cgaaccttaa 
tcccgtcttc 
cgctttcctg 
tcgatattga 
gtttactgcg 
aegactgeta 
gaeataaegg 
gcgaattgat 
tttatgacga 



t accgccat g 
ttttctggtt 
tttggtgttt 
ccactggatt 
cccggaaagc 
cgacct tgtc 
ggcctggtta 
at'ggctgaac 
gacgaccact 
cgcaccggta 
cgcctggctg 
gctgcccgct 
gt eggatata 
gttcaagaac 
cgccagctgc 
tctgtttgat 
ccagttcggc 
ggatcaaaca 
taccgctgtg 



ccttcttccc 
aagttcggcc 
gccgcgtttc 
gccctgccga 
ataatgagee 
acaegcttt a 
ttcctgtcac 
gtacttaccc 
gt aacaacga 
gtgggtgata 
aataatttct 
gatgetcage 
gaagccgccg 
t ttaactccg 
gggcagactt 
aacctttcga 
ggttttttga 
aatctgeegg 
cttaaccgct 



tctggcaata 
tgt t gtgggc 
tgetgatgee 
teggctttge 
agggttcgea 
ttaactggca 
aatggattcg 
tg'gcgggacc 
cgggtggtaa 
tgcccgcaca 
ataacgegga 
catttgaact 
ggttgatgtc 
ccacctecta 
cgcacactaa 
aactgggctt 
aagaagttcg 
ttattttget 
ggctggacgt 



ctggcgcggc 
gggatatctt 
ccttccgcgc 
tttgttctgg 
ggtggcgggg 
gatgattggg 
cattacegtt 
aagtttctcc 
cgcagcggca 
aactgcaccg 
ggcgaaacgt 
actggtgat t 
gcatccactg 
cagtggcccg 
tctgtatcaa 
tacccagcac 
cgaaaatggc 
gggctttgat 
taccgaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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gataaaaaca 
tatccggggg 
ctggacgcct 
ccggaacacg 
cctagcccgt 
catcaggggg 
gtggttcgcg 
accagtgggt 
taccaggata 



gccgtagt gc 
tcagcaaaac 
tctttactga 
gcggcgcgct 
ctatcaccga 
caccgatt gt 
t tct cgatgg 
tgccacaaac 
aaccgtacgt 



<210> 172 
<211> 384 
<212> DNA 
<213> E. Coli 



cacgttctac 
agcggattac 
acttgagaaa 
gaagggcgac 
cgt ccccgtt 
cat cgaacaa 
caagat tttc 
agcaccggtc 
tcgcctgaac 



aacacgctt c 
aaagcgcggg 
tcgggtcgt a 
agaatgcagg 
ggggtgaaat 
ccgagcagct 
accgaagaca 
tccgagaact 
ggcggcgact 



PCT/US00/02200 



cactgcatga 
cgcagaaatt 
aagtgatggt 
tatctggcct 
tcttcggcat 
tcctggctat 
atgttgactg 
caaatgcagt 
gggtgcctta 



cggcaaccat 
ctttgatgaa 
ggtcgtggtg 
acgtgatatc 
gaaggcaccg 
ctccgatctg 
gaaaaaacr c 
agttattcaa 
cccgcagtaa 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



<400 
atggaaggtt 
gccggtagtt 
agctatgccg 
gaagtccggg 
aagcagaaag 
gatgccgata 
aagctggaag 



> 172 
caagaatgaa 
a tgccactac 
aaaaacacca 
ccaactgt tc 
atgaggtggc 
agattgccaa 
cgcgcgacta 



ataccgcatc 
cctgtgtcag 
aaaccagaat 
aga tagccag 
ggaacgccag 
acgcgaacgg 
ctaa 



gctttagctg tttctctctt 
gaaaaggagc aaaatatcct 
cgtattgacg gtctgaataa 
ctgcgtgccg atcatcagaa 
caagatttag ccgaggcgaa 
aaactggcag aagcgcagga 



tgctcttagt 
taaggagat c 
agccctgagt 
gaaaatcgca 
gcaaaaaggc 
agagctgaaa 



<210> 173 

<211> 306 

<212> DNA 

<213> E. Coli 

<400> 173 

atgtcgaaag aacacactac ggaacatctg cgtgctgagt tga'aatccct ttccgatacg 
ctggaagagg tgcttagct c • at ctggcgag aagtcgaaag aagagttgag taagattcgt 
agcaaagcgg agcaggcact gaaacagagc cgttatcgcc tgggtgaaac cggtgatgcc 
attgccaaac aaacccgtgt cgcggcggcg cgtgccgatg agtatgtgcg cgaaaatccg 
tggacgggcg tgggcattgg cgctgcaatc ggtgtagtgc tcggcgttct gctgtcgcgt 
cgttaa 

<210> 174 
<211> 405 
<212> DNA 
<213> E. Coli 



60 
120 
180 
240 
300 
360 
384 



60 
120 
180 
24 0 
300 
306 



<400> 1 
atggcggaca ct 
attgtttcta tc 
gaagaggaaa aa 
gctgcatttg gt 
cgcctgaatg eg 
atetggaege ta 
gcaaacgatc gg 



74 

catcacgc 
atggttga 
gegaatet 
cttatgag 
atgattgc 
egtaaate 
cagct get 



acaagggccc 
aatggt agag 
ctttcaactt 
cctgatggtg 
caccaccgtg 
gegtaagtet 
cgaggaggag 



ggtaaaagcg 
acacgtctgc 
ttactgatgc 
ctaattattt 
gtgttgctgc 
acgttgctgc 
tcccgtgagc 



t tctgggcat 
ggctggcggt 
tgggectgae 
gggcggttga 
tactggcact 
gccatacacg 
agtaa 



egggcagega 
ggtggagctg 
gatgetttte 
cccgcaatat 
gattggcggt 
ccatgagtta 



60 
120 
180 
240 
300 
360 
405 



<210> 175 

<211> 300 

<212> DNA 

<213> E. Coli 

<400> 175 

gtgagcagta aagtcgaacg tgaacgacgt aaggegcaac tgettageca gatccagcaa 60 

caaeggctgg atctttccgc cagtegtegt gaatggctgg agacaacagg cgcttacgat 120 

cgtcgctgga atatgetget aagtctgege tcctgggcgc tggttggcag tagcgtgatg 180 

gcgatctgga cgattcgcca tcctaatatg ctggtccgct gggccagacg cggttttggc 240 
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tct ggttaaaacg accctcaagc agcaacagct tcgcggttaa 



<210> 176 
<211> 483 
<212> DNA 
<213> E. Coli 



<400 
atgattctct 
agcagcctga 
gtagcgcgca 
tacgcgggta 
gtgattctgc 
acagccctgt 
gctgaaggcg 
ctgctggcaa 
taa 



> 176 
ccatcgacag 
gt ggcgcagt 
ttttaatgcc 
cccaacaata 
t tgagtttgg 
ttactgcggg 
tcaactcgct 
tt accggtcc 



caacgacgct 
ggagagtatg 
gattctgttt 
tatggaagca 
tggtggtctg 
ctttacgctg 
gatgttcatg 
gggcgcgtat 



aat accgcgc 
atgaaaaaat 
att accgctg 
atgggcgtcc 
gcaatcctgt 
ctgacggcat 
aaaaacctga 
agcatcgacc 



cattgcacaa 
tagaagatgt 
get ggggaaa 
egggt tttat 
teggtttect 
ttttatttca 
caatttctgg 
gectgetgaa 



aaaaacaatc 
tggtgt actg 
aattactggc 
get gecactg 
gactcgcacc 
cagcaacttt 
eggatt cctg 
taaaaagtgg 



<210> 177 
<211> 891 
<212> DNA 
<213> E. Coli 



<400 
atgatcaaga 
gcccactacc 
ccagaaataa 
ctggggccac 
catatgeat a 
gegcattteg 
aaaatttatt 
cgggttcttg 
ttaatgttgt 
ggcgatgtag 
aeggtttate 
gaagcatgtc 
tctgaagggc 
cctctggatg 
agaaatcget 



> 177 
agacaaegga 
agtggctggt 
eggataacca 
tcgataacga 
actgtggtcg 
acgcctttca 
tgctgacgat 
atgaatcctt 
t ggatattga 
tat tacgeae 
getaeggggg 
gtgcaggtgt 
atatcaacat 
tggtcattgg 
gcatgt.ttat 



aattgatgee 
gagtatgttt 
ttcttatgga 
tgaattacct 
ggaatt aatg 
ggaggggttg 
ccgtagcaat 
tgatcatcag 
ccgatttaaa 
cctggcaact 
cgaagaattt 
cagaatttgc 
tacegtgaca 
aagageggae 
tgacgaacaa 



atcttgttaa 
cacagcgtgg 
ctgtgccagt 
tacgttcggc 
ctggctattg 
ctttctttta 
atggatgttt 
ttacgcaacg 
ttggttaatg 
t act tageca 
atcattattg 
cagttagtcg 
gcaggtgtga 
egggcaatgt 
aatgtgatta 



atctcaataa 
tegegagaga 
ttggtcggtg 
taatggattc 
tt gaaaatca 
ctgeggcatt 
tgaegggatt 
ctgagcctct 
atacctaegg 
gttggacgcg 
teaaagegge 
ataaccatgc 
gtcgegcatt 
atgagggtaa 
accgagttta 



ggctat cgat 
tgccagtaag 
gattgat cat 
tgecca tcaa 
ctggcaggac 
aaccgattac 
geegggtegt 
gaatct ttat 
gcatttaatc 
tgattacgaa 
taatgatgaa 
catcacacat 
tcctgaagag 
geaaacegga 
a 



<210> 178 
<211> 612 
<212> DNA 
<213> E. Coli 



<400 
atgcgccttc 
tataccccta 
gatatgttgg 
cttatgtata 
gtegcattea 
tctttcgtgg 
ttactaccag 
gatttcctta 
aaattagaac 
ctegt caccg 
tttaatcgtt 



> 178 
gtgttgtgcc 
cagegagage 
atgtatttac 
cgattatcat 
atgett atac 
tcgccatttc 
agcaacgagt 
aaacaggaaa 
ggaagggcat 
aaaaat actc 
ag 



eggttttatt 
ttgtgt taac 
tccattgttg 
ttttggtctc 
tgaaatacct 
cttctcaaga 
attgettegt 
tcttattatc 
cattcaacat 
ccattt tatg 



tcaccacctc 
atttcaatac 
aaactttttg 
actctctggc 
tggctctttc 
ttgegagcac 
ttatctgaga 
acttctcctt 
cagagtgata 
aagttattct 



egggcttegg 
ccttacagtt 
ctaacgagcc 
t gat accgaa 
agattatcgt 
atattcaaaa 
aagaaatege 
gccgtaaccc 
gcgcaaactg 
ggaacagcag 



tggtctegge 
gagagttatt 
actcgaaaga 
agagtt tact 
■ttttgccttt 
gcattattca 
tgtatttaaa 
ggttatgaaa 
ttcttattat 
gagtagacgt 



<210> 179 
<211> 177 
<212> DNA 



300 



60 
120 
180 
240 
300 
360 
420 
480 
483 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
891 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
612 
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<400> 179 

gtgcttctc.c aaccatcggc gcgcaccagt ttcggtttta aatgttttgc ttttggtata 60 

cgtcatggca gtgaacgttc catcctggtt ggggaacacg ccgcacacca gggattcgtt 120 

gttgccgagg tcgatttttt gcattttgcg aatctcacat cttgttgcta cgtatag 177 

<2.10> 180 
<211> 4281 
<212> DMA 
<213> E. Coli 

<400> 180 

atgagcggaa aaccagcggc gcgtcaggga gatatgactc agtatggcgg tcccattgtc 60 

cagggttcgg caggtgtaag aattggcgcg cccaccggcg tggcgtgctc ggtgtgtccg 120 

ggcgggatga cttcgggcaa cccggtaaat ccgctgctgg gggcgaaggt gctgcccggc 180 

gagacggacc ttgcgctgcc cggcccgctg ccgttcattc tctcccgcac ctacagcagc 240 

taccggacga agacgcctgc accggtgggc gttttcggcc ccggctggaa agcgccttct 300 

gatatccgct tacagctacg tgatgacgga ctgatactca acgacaacgg cgggcggagc 360 

attcactttg agccgctgct gccgggggag gcgg'tgtaca gccgcagtga gtcaatgtgg 420 

ctggtgcgcg gtggtaaggc agcacagccg gacggccata cgctggogcg gctgtggggg 480 

gcgctgccgc cggatatccg gttaagcccg catctttacc tggcgaccaa cagcgcacag 540 

gggccgtggt ggatactggg gtggtctgag cgggtgccgg gtgctgagga cgt'actgcca 600 

gcgccgctgc cgccgtaccg ggtgcttacc gggatggcgg accgcttcgg gcggacgctg 660 

acgtaccggc gtgaggccgc cggtgacctg gccggggaaa tcaccggcgt gacggacggt 720 

gccgggcggg agttccgtct ggtgctgacc acgcaggcgc agcgtgcgga agaggcccgc 780 

acctcttcgc tatcttcttc tgacagttcc cgccctctct cagcc'tcagc gttccccgac 840 

acactgcccg gtaccgaata cggccccgac aggggtatcc gcctttcggc ggtgtggctg 900 

atgcacgacc cggcataccc ggagagcctg cccgctgcgc cactggtgcg gtacacgtat 960 

acggaagccg gtgaactgct ggcggtatat gaccgcagca atacgcaggt gcgcgctttc 1020 

acgtatgacg cgcagcaccc gggccggatg gtggcgcacc gttacgcggg aaggccggag 1080 

atgcgctacc gctacgacga tacggggcgg gtggtggagc aactgaaccc ggcagggtta 1140 

agctaccgct atctttatga gcaggaccgc atcaccgtca ccgacagcct gaaccggcgt 1200 

gaggtgctgc atacagaagg cggggccggg ctgaaacggg tggtgaaaaa agaactggcg 1260 

gacggcagcg tcacgcgcag cgggtatgac gcggcaggaa ggctcacggc gcagacggac 1320 

gcggcgggac ggaggacaga gtacggtctg aatgtggtgt ccggcgatat cacggacatc 1380 

accac'accgg acgggcggga gacgaaattt tactataacg acgggaacca gctgacggcg 1440 

gtggtgtccc cggacgggct ggagagccgc cgggaatatg atgaaccggg caggctggta 1500 

tcggagacat cgcgcagcgg ggagacagt-a cgctaccgct acgatgacgc gcacagtgag 1560 

ttaccggcga cgacaacgga tgcgacgggc agcacccggc agatgacctg gagccgctac 1620 

gggcagttgc tggcgttcac cgactgctcg ggctaccaga cccgttatga atacgaccgc 1680 

ttcggccaga tgacggcggt ccaccgcgag gaaggcatca gcctttaceg ccgctatgac 1740 

aaccgtggcc ggttaacctc ggtgaaagac gcacagggcc gtgaaacgcg gtatgaatac 1800 

aacgccgcag gcgacctgac tgccgttatc accccggacg gcaaccggag cgagacacag 1860 

tacgatgcgt ggggaaaggc ggtcagcacc acgcagggcg ggctgacgcg cagtatggag 1920 

tacgatgctg ccggacgtgt catcagcctg accaacgaga acggcagcca cagcgtcttc 1980 

agttacgatg cgctggaccg gctggtacag cagggcggct ttgacgggcg gacgcaacgt 2040 

tatcattatg acctgaccgg aaaactcaca cagagtgagg atgagggact tgtcatcctc 2100 

tggtactacg atgaatcgga ccgtatcact eaccgcacgg tgaacggcga accggcagag 2160 

cagtggcagt atgatggcca cggctggctg acagacatca gccacctgag cgaaggccac 2220 

cgtgttgccg tccactatgg ctatgacgat aaaggccgcc tgaccggcga atgccagacg 2280 

gtggagaacc cggagacggg ggaactgctg tggcagcatg agacgaaaca cgcatacaac 2340 

gagcaggggc tggcaaaccg cgtcacgccg gacagcctgc cgccggtgga gtggctgacg 2400 

tatggcagcg gttacctggc gggaatgaag ctgggcggga cgccgctggt cgagtatacg 2460 

cgggacaggc tgcaccgtga gacggtgcgc agcttcggca gcatggcagg cagtaatgcc 2520 

gcatacgaac tgaccagcac atacaccccc gcaggccagt tacagagcca gcacctgaac 2580 

agcctggtat atgaccgtga ctacgggtgg agtgacaacg gcgacctggt gcgcatcagc 2640 

ggcccgcgac agacgcggga atacggctac agcgccacgg gcaggctgga gagtgtgcgc 2700 

accctcgcac cagacctgga catccgcatc ccgtatgcca cggacccggc gggcaaccgg 2760 

ctgccggacc cggagctgca cccggacagt acactcacag tgtggccgga taaccgcatc 2820 

gcggaggatg cgcactatgt ctaccgccac gatgaatacg gcaggctgac ggagaagacg 2880 
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gaccgcatcc cggcg^gtgt gatacggacg gacgacgagc ggacccacca 
gacagccagc accgcctggt gttctacacg cggatacagc atggcgagcc 
agccgctacc tctacgaccc gctgggacgg cgaatggcaa aacgggtctg 
cgtgacctga cggggtg'gat gtcgctgtcg cgtaaaccgg aggtgacgtg 
gacggagaca ggctgacgac ggtgcagact gacaccacac gtatccagac 
ccgggaagct tcacgccgct catccgggtc gagacagaga acggcgagcg 
cagcggcgca gcctggcaga gacgcrccag caggaaggga gtgagaacgg 
gtgttcccgg ctgaactggt gcggctgctg gacaggctgg aggaagaaat 
cgcgtgagca gtgaaagccg ggcgtggctt gcgcagtgcg ggctgacggt 
gccagacagg tggagccgga atacacaccg gcgcgaaaag .ctcatcttta 
caccggggac tgccgctggc gcttatcagc gaagacggca atacggcgtg 
tatgatgaat ggggcaacca gcttaatgag gagaacccgc atcatgtgta 
cgtctgccag ggcagcagca tgatgaggaa tcagggctgt actataaccg 
tacgatccgt tgcaggggcg gtatattact caggacccga tggggttgaa 
aatttatatc agtatccttt aaatccacta caacaaattg accctatggg 
acttgggatg atgccagatc tggagcatgt acggggggag tttgtggtgt 
ataataggac caagtaaatt tgatagtact gcagatgctg cgttagatgc 
acgcagaata gatctctatg taatgatatg gaatactctg gtattgtctg 
aatggaaaat attttgcatc taaggcagaa actgataatt taagaaagga 
ctgaaaagaa aatgtcccac aggtacagat agagttgctg cttatcatac 
gatagtcatg gcgattatgt tgatgaattt ttttcaagta gcgataaaaa 
agtaaagata ataatcttga agcattttat ctcgcaacac ctgatggacg 
cttaataata aaggagaata tatttttatc agaaatagtg tcccgggatt 
tgcataccgt atcatgatta a * 

<210> 181 
<211> 369 
<212> DNA 
<213> E . Coli 

<400> 181 

atgaaatata gttcaatatt ttcgatgctt tcatttttta tactatttgc 

acagctgttt acggttctga tgaaaacatt atttttatga ggtatgtgga 

ttagataaat actctgttaa aaatacggta aaaactgaaa caatggcgat 

gaaatatatg ttaggtatcg ctatggcgaa cggattgcag aagaagaaaa 

attacggaac taccagatag ttgggttgtt gagggagcaa agttacctta 

ggtggtgtat . ttattataga aattaataag aaaaatggat gtgttttgaa 
agtaaataa 

<210> 182 
<211> 711 
<212> DNA 
<213> E . Coli 



ctaccactac 

actggt cgag 

gcggcgggag 

gtatggctgg 

ggtatacgag 

ggaaaaagcg' 

ccacggcgtg 

ccgggcagac 

ggagcaactg 

tcactgcgac 

"gagcgcggaa 

tcagccgtac 

tcaccggtac 

agggggatgg 

attattgcag 

tctttcacgt 

tttgaaagaa 

taaaga tact 

gtcatatcct 

tcacggtgca 

tcttgtaaga 

att tgaggcg 

gagttcagta 



ctgtaatgag 
aaaattacat 
acaattagct 
accatattta 
tgaagttgcg 
tttcctacat 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3 660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
•4281 



60 
120 
180 
240 
300 
360 
369 



<400 
atgctggcgc 
ggcaaccagc 
cagcagtatg 
caggggcggt 
tatccgctga 
ctaataagaa 
tatgaagata 
•ttttgtcgag 
tatgaaaaag 
aagctatccc 
gggttgacat 
gagcataaaa 



> 182 
tgatggatgc 
tgaa t gaaga 
ataaggagtc 
atatcactca 
atccggtgaa 
gaaaagatca 
tgaagagatt 
tgtctaaat t 
agat tagaga 
attctgaaat 
gtccatcaac 
aaacgataaa 



gg'atggaaac 
gaacccgcat 
ggggctgtac 
ggacccgata 
t ggtattgat 
actaaaccat 
aaatttaggt 
aaatgacgct 
ttacgggtta 
gattgaagat 
aacagatt gc 
ggctt tacaa 



attgcgtgga 
cacctgcacc 
tacaaccgga 
gggctggagg 
ccattagggt 
caaagagcat 
gggactgatc 
ggtgttagcc 
aatctgttcg 
aataaaaaag 
tcagataga t 
gatgctggct 



gcggggagta 
agccgt accg 
accggtacta 
ggggatggag 
taagt cccgc 
gggatatatt 
aatttttcca 
gat cggcgaa 
gtatgtacgg 
acttggctgt 
gtagtgatta 
atctcaagta 



tgatgagtgg 

gctgccgggg 

cgatccgttg 

tctgtatgcg 

agatgtagcg 

atctgat act 

ttgtatggca 

agggctgggt . 

cagaaaagta 

aaatgaccat 

tattaatcca 

a 



60 
12b 
180 
240 
300 
360 
420 
480 
540 
600 
660 
711 



<210> 183 
<211> 261 
<212> DNA 
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<400> 183 

atgctggcta tctcaagtaa tctatcaaag atgataatat ttatttttgc tattataatc 

attgttgttt tatgcgtaat tacttatctt tatttataca aagatgaatc tcttgtaagt 

aaacattaca taaactatat ggcaatacca gaaaatgatg gagtttttac atggctccca 

gatttttttc cgcacgtagc ggtggatata ccaacataca caaatataga agatgattat 
ttttttctta tttttcccta a 

<210> 184 
<211> 192 
<212>'DNA 
<213> E. Coli 

<400> 184 

gtgagggcca gggaacaagt ggcgaaaatc gtatcaaaga atgatccaga tacaaaaaaa 
gtgtggtgta aatatggtaa gataccaggg caaggggatg gtgtaaacct tttttttgtt 
ggtgaaatta atgttacgca ttattttata acaaatattg gagctggatt gcctgatgct 
tgtgcagagt aa 

<210> 185 

<211> 504 

<212> DNA 

<213> E. Coli 



<400 
atgccgggca 
aaact caggc 
gcggaaatgg 
tacgcgtatg 
gcgacgctgg 
gatggctggc 
tatacgcagc 
cggaagtcgc 
ctgaacataa 



> 185 
acagcccgca 
cgcagtcggt 
acgaacagt g 
acagtctccg 
ggcgtcttat 
cgctgtatga 
gaattgag'cg 
tgtcgttctc 
aacactatca 



ttatgggcgt 
aacct cgcgc 
gggctatgtc 
gaagacggtt 
gagcctgctg 
.atcccgcctg 
gcataacctg 
aaaatcggtg 
ataa 



tggcctcaac 
at acagccgg 
ggggctaaat 
gttgcgcacg 
tcaccctttg 
aagggaaagc 
aatctgaggc 
gagctgcatg 



acgattttac 
gcagtgacgt 
cgcgccagcg 
tattcggtga 
acgtggtgat 
tgcacgtaat 
agcacctggc 
acaaagtcat 



gtcact taaa 
catcgt ctgc 
ctggct gttt 
acgcactatg 
atggatgacg 
cagcaagcga 
acggct ggga 
cgggcattat 



<210> 186 
• <211> 276 
<212> DNA 
<213> E. Coli 

<400> 186 

gtggcttctg. tttctatcag ctgtccctcc tgttcagcta ctgacggggt ggtgcgtaac 
ggcaaaagca ccgccggac.a tcagcgctat ctctgctctc actgccgtaa aacatggcaa 
ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc- agaaaatcat tgatatggcc 
atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct caacacgatt 
ttacgtcact taaaaaactc aggccgcagt cggtaa 

<210> 187 

<211> 417 

<212> DNA 

<213> E . Coli 



60 
120 
180 
240 
261 



60 
120 
180 
192 



60 
120 
180 
240 
300 
360 
420 
480 
504 



60 
120 
180 
240 
276 



' <400> 187 

atgatgacta aaacccaaat aaataaatta ataaaaatga tgaatgattt 

tttgaagcac cgctcaagga atcatttatt gaaagtataa tccaaataga 

aattcaacta attgcctgga gaagttatgt aatgaagtta gtattctttt 

cctgattatc ttactttttt aagagcaatg gatggattcg aagttaatgg 

tttagcctct cgattccaga accttcagtt aaaaaccttt ttgccgtaaa 

agaaataatg atgatttcat aaaccctgat ctacaagaac ggttagtgat 

agcatttcaa tatttactta tgacattaaa ggtgatgctg ccaacttact 



agactatcca 
atttaattct 
taagaatcaa 
attacgatta 
tgaattttat 
cggggattat 
gatttag 



60 
120 
180 
240 
300 
360 
417 
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<210> 188 
<211> 1179 
<212> DNA 
<213> E . Coli 



<400> 188 

atgagtaata ttgtttacct gacagtaacg ggagaacaac aaggaagcat ctccgcaggt 60 

tgtgggactt ctgagtctac aggtaatcgt tggcagagcg ggcatgagga tgaaatattt 120 

acattctcac tcttaaataa tattaataat acggggcttg gttcacagtt ccatggtata 180 

acattttgta aattaattga taaaagcact ccattattta ttaattccat taacaataat 240 

gaacaattat ttatgggatt tgacttctat cgaataaata gatttggtag attggaaaag 300 

tattattata tacaactaag aggcgctttt ttatcggcta ttcatcacca gatcattgaa 360 

aaccaactgg atacagaaac aataactatt agttatgaat ttatcctctg tcaacatctt 420 

atcgcaaata ccgagttcag ctatttggca ctccctgaaa attataaccg tttgttttta 480 

ccaaattcaa aaaaccaaac aaataatcgt ttcaaaacgt taaacagcaa ■ agctatt ggc 540 

aggctacttg ctgctggtgg cgtatacaat gggaacattg aaggattcag agatactgcg 600 

gaaaaactgg gtggagatgc aataaaaggc tatgatcaaa tactaaatga aaaaacagcg 660 

ggcatagcga tagcaacagc atctattctt ttaacaaagc gttctaatgt tgatacatat 720 

acagaaataa atagttactt aggcaaactt agaggtcaac aaaaacttct tgatggtata 780 

gacataatag aaataatata cattaagaga ccttcaaaag acttagctaa cttacgaaag 840 

gagtttaata aaactgtaag aaaaaatttt cttatcaaac ttgcaaaaac ctccgaagca 900 

tctggaagat tcaacgccga agacctttta agaatgagaa agggcaatgt tcctctaaat 960 

tataatgttc accataaact atctctagat gatggtggta ctaatgattt cgaaaattta 1020 

gtattaatcg aaaacgaacc atatcataaa gtttttacta acatgcaatc acgaatagct 1080 

aagggaatat tagtaggtga aagcaaaatc actccctggg ccattccatc tggctcaatt 1140 

tatcctc'cca tgaaaaatat tatggaccac acaaaatga 1179 



<210> 189 
<211> 666 
<212> DNA 
<213> E . Coli 



<400> 189 

atggtacttg ctttgaacta taatatgcac ggagttaata ttcgctcaga gaatgcagca 60 

aaacctcata cgatgccctc tagatatctt tgcgagtata ttagaagcat tgagaaaaat 120 

ggccacgccc ttgattttgg ctgcggaaaa cttagatatt ctgatgaatt aatcagtaaa 180 

tttgatgaag ttacttttct agactcgaaa aggcaacttg aaagagagca aattattaga 240 

ggaattaaaa ctaaaattat tgactatgtc ccacgatatt ataaaaatgc aaatacagtt 300 

gctttcgagg atgtcgacaa aataattggg ggttacgatt tcatcctttg ctctaatgtt 360 

ctctctgccg ttccttgtcg ggatacaatc gacaaaatag ttcttagcat. caagagatta 420 

ctaaaatcag gaggtgagac tcttattgta aatcaatata aaagctcata cttcaaaaaa 480 

tacgaaacag gaagaaaaca tctttacgga tacatttaca aaaattcaaa aagtgtttct 540 

tactatggat tactcgatga actcgcagtg caagaaatat gttcttcaca tggccttgaa 600 

atattaaagt cgtggagtaa agcaggaagt tcatatgtca ctgttgggag ttgtaatgca 660 

atataa • 666 



<210> 190 
<211> 705 
<212> DNA 
<213> E . Coli 



<400> 190 

gtgaataata tgttcgaacc ccccaaaaat tataatgaaa' tgttgcctaa acttcataaa 60 

gcaactttct taaatacgct aatatattgc atacttctag ttatttacga atacatccct 120 

ttaataacat taccaaccaa gtatgtccca cctattaaag a.tcatgagag ctttattaat 180 

tgggcactat cttttggtat attaccttgt gcttttgcga tttttgcata tttaattagc 240 

ggtgcgttag acctacataa caatgcagcc aaactacttc gggtgcgata tctttgggat 300 

aagcatctaa ttataaaacc gttatcacgg agagctggag tcaacagaaa attaaataaa 360 

gatgaagctc acaatgtaat gagcaatcta tattaccctg aagtaagaaa aattgaagac 420 

aaacattata ttgaactctt ctggaataaa gtatactatt tttggatatt ttt tgaattt 480 
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tcgataattg cattaatttc cttcctaata atcttttttt gcaaacaaat ggatattttt 540 

catgttgaag gttctttgct gtctttattc ttttttgtaa ttttatcatt ctcagtgagt 600 

ggtattatct ttgctttgac agttaagccc agaactgaaa gtcaagtcgg aaaaatcccg 660 

gacgataaaa taaaagaatt tttcactaaa aataacatta attga 705 

<210> 191 
<211> 285 
<212> DNA 
<213> E . Coli 

<400> 191 

atgtttacta tcaacgcaga agtacgtaaa gagcagggta agggtgcgag ccgccgcctg 60 

cgtgccgcta acaagttccc ggcaatcatc tacggtggca aagaagcgcc gctggctatc 120 

gagctggatc acgacaaagt catgaacatg caagctaaag ctgaattcta cagcgaagtt 180 

ctgaccat-cg ttgttgacgg taaagaaatc aaagttaaag ctcaggacgt acagcgtcac 240 

ccgtacaaac cgaagctgca gcacatcgac ttcgttcgcg cttaa 285 

<210> 192 
<211> 1977 
<212> DNA 
<213> E . Coli 



<400 
atggtattgt 
atgaatctga 
cgtgaagagg 
actgagcaat 
gaagggttgg 
gccacactca 
gtggtgctgc 
gcgctgacca 
cctgatgagt 
tccaacgcgc 
atggctgcgg 
aaacaaggcg 
gcctgtattt 
gcgctttcag 
cttaccct ca 
agtgtcaaaa 
attgtcgcgg 
caagatctgt 
ctgctcggaa 
aaaccggcgt 
. tttctcggcg 
cact tgcgt c 
ggtacgttgg 
aataactcgc 
gccattct eg 
gcattctgcc 
gtcaaaatgg 
aaagagtttg 
gaaggggega 
ggctctatgg 
geeggcattt 
attggctggt 
tggcggcgtc 



> 192 
tttateggge 
cgactctgac 
ccat ccacgc 
ttctggaaga 
ctgtgccgca 
gcgagccgct 
tggegattec 
cgcgccttgc 
tgctctcggc 
caactatcgt 
aatatctgga 
etaaeggcat 
ttgcggctga 
tgcctgttgc 
agegt agega 
eggagctgaa 
ggggcacggt 
ttaat gaaga 
ttttgatggt 
tagegecagg 
cggtcgttgg 
ttagcagtaa 
gagegggcag 
ttaccgcct g 
gttttatgtg 
tgggcgcaat 
ttteggcatt 
aaat tgagac 
tcccgatggc 
taaegggege 
tctcgctctt 
ttggcgcggc 
acgcggttaa 



acactggcgc 
ccaccgcgat 
gttgactcaa 
agtgtatcgc 
tggcaaaact 
tcagtgggaa 
ccccaatgaa 
ggatgatgag 
gctggatgac 
ctgcgtaacg 
aaaagccgga 
tgaagggcgt 
agtcgccatc 
egagecgatt 
tgagacgcgt 
acaggcactg 
gctggcggtc 
aaactcctgg 
accggtgctc 
etttgegget 
eggattgata 
attcaatgga 
t ctgatgctg 
getgaaeggt 
ttcctttgac 
ggegaaegge 
t accgtaacc 
egggaaatec 
gat tgaagat 
t attgtcggt 
tttacttcat 
attggtgggg 
gcatggcaac 



gactataaaa 
gcgttgtgt c 
cgtcttgctg 
cgtgaaagcc 
getgeggtaa 
ggcgttgatg 
gegggtacta 
at tegggege 
aagggaggca 
gcctgtccgg 
cgcaaact eg 
ttaaeggegg 
aaggagagtg 
cgccatgcag 
accgtacagc 
ttgagcggaa 
geggtattae 
ctgtggatgt 
geggectata 
ggacttgccg 
geeggttact 
ttcctgactt 
tttgtggtgg 
ctgtcaggaa 
cttggagggc 
gtttaeggee 
gcttccacga 
acctggctgt 
ccgctgcggg 
gcgatgaat a 
gataatggcg 
getgeaatet 
tatctgactg 



acgatcaagt 
tgaatgcgcg 
ctctggggaa 
ttggcccgac 
aagaagegge 
gcccggaagc 
egcatatgea 
gtatacagtc 
cgcaacctt c 
egggtattge 
gcgtaaatgt 
atcaactcaa 
agcgttttaa 
a'agcgttgat 
aagatacgea 
tetcttttge 
tgtcgcaaat 
accgcaagct 
ccgcctattc 
ccaacatgat 
tgatgcgctg 
tttatctcta 
gggaacctgt 
gtaacgeget 
cagtgaataa 
egtatgecat 
tgctcgcacc 
tagggctggc 
ttattggttc 
tcggcctttc 
cgggcggtgt 
egactgeaat 
atggcgtaat 



gaggat catg 
ctttaccagc 
aatttccagt 
egcett aggt 
gtttgcggtc 
agttgattta 
actgetgaca 
ggcgacgacg 
tgcctctttt 
tcacacctat 
ttacgttgaa 
tagtgeaace 
tggcat tccc 
ccaacaagcg 
aceggt gaaa 
cgtaccgttg 
ettegggcta 
gggcggcggg 
tctggcagat 
cggctccggg 
ggtgaaaaat 
cccggtgctc 
cgcctggatc 
gttgctgggt 
ageegcttat 
tttcgcctcc 
gcgcctgttt 
aggtattacc 
gtttgtgctg 
gacacccggt 
tatggeggea 
tctcctgatg 
gecataa 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1977 



<210> 193 
<211> 2634 
<212> DNA 
<213> E . Coli 
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<400 
atgaaagcag 
ttcaccaccg 
ctggaacagg 
gattatttcg 
aagttgat.ta 
atcgtccgta 
ataggttatt 
tttggcatta 
accgagtttt 
ggctacgcca 
agttattttg 
catgaccaga 
taccctcaac 
cagcgagata 
catcgcacca 
aagattgtta 
caccacggct 
atcggctgct 
gaagacatgg 
cagagcgacg 
atcaacacca 
cctgtaccgt 
atagttcatt 
gtcccttcta 
gcggcaaaaa 
. gaggatggtt 
caaattgaag 
tgggtaatta 
agcagggctg 
cggcaatcca 
cgtattgatg 
gtccctacac 
cgccccgt ga 
gttccggtat 
gtctttagcg 
attacgttgc 
cggccttcgg 
cgcctaagtt 
cgagcatggc 
aacaaagccg 
tgcctgat aa 
aatccggctg 
tgct cagaaa 
tcggggcctt 



> 193 
tatct cgcgt 
aagagtcacg 
acaacgaata 
cggt gaaacc 
tcggcccctg 
atctgatgta 
taccagattc 
cccgcaccat 
tgtggcaaag 
tcggtaagta 
acgtgctgga 
tgccattgca 
gtaagtttgt 
atctggcaac 
tcggttctac 
at ctgctgga 
tgctggagaa 
gctgtagtga 
cggataatct 
ccgacaaact 
ctgtgcggct 
attttattcg 
acggtaatta 
tgggctatcg 
gt gacgct ga 
ctctgcaact 
aaagctctga 
ccgcagcgaa 
ttatccgcta 
ctggcagagt 
tggatatcaa 
cttttaacac 
acgacagtgc 
ggaatatgct 
aagggttacg 
tgcgtggcgt 
gaattaaaat 
tattgagtta 
tgactccagt 
gattcaacgt 
gcgcacttaa 
aatcagcaac 
cgatgatgga 
ttttacccgg 



t cacatcacc 

tattctgctg 

caaa tattac 

ggaaaacaaa 

gtatacccag 

cggaatgcgt 

ctttggcatg 

gttctggcgc 

cagtgacggt 

cttacctgcc 

aaaagcgtct 

gcaaaatatc 

gatgagccgc 

cctgaaaggg 

gcgtatggat 

accgctggca 

aatgtggaaa 

caaagttcat 

gattcgtttc 

cgtcctgttt 

gcgcgccagc 

ccatgcccgt 

cgatcccttt 

cacgctttat 

agggatactg 

ggt agataaa 

tgatggtgat 

cgcgaaaccg 

tgacatggca 

aggggtggtg 

tcttgataac 

cgacagtgtt 

aatgaacaac 

caactatgtt 

tgaatttgaa 

gggcttactg 

gccagtcccg 

taccggtacg 

acagtgttac 

gccggaaagt 

gaaagctgaa 

ctgtgatgcg 

tgaacacatt 

ccagtcacgg 



ccgcat a tgc 

gtcaataata 

gtactcgacg 

gaccgtgtga 

accgat acca 

gactgcctcg 

tccgggcaac 

ggatgttcgg 

agcgaagtga 

gacgaaaacg 

gtaaccaaag 

ttcgaagtga 

t ttgaagagg 

gaatttattg 

atcaaaattg 

acactggcct 

gagatcttaa 

cgcgaaatcg 

tacatgcgca 

aacctgatgc 

cagtttaatt 

gagatcgatc 

atggagtttg 

at cgaagcga 

gaaaatgctt 

gacagcggtg 

gaatatgact 

caatgcgata 

gtgccgctca 

ttggttgtca 

caggctgacg 

ctggcagata 

tggcagcaag 

gccttacagg 

gtcatcggtg 

ggcaaagaag 

gactcacaac 

ccaaccgccg 

aacaaaatcc 

tatagtttgt 

gaccgacaag 

actgttgctt 

accaccgagg 

acgttcagtt 



actgggatcg 

tggaagagat 

ggcaaacggc 

agaaacaggt 

cgattgtttc 

cgtttggcga 

tgccgcatat 

agcgccacgg 

cggcgcaggt 

gattacgtaa 

agatt ttgct 

tggataagct 

tattt gagaa 

atggcaaata 

cccacgcgcg 

ggacgttggg 

aaaat catgc 

tcgcccgctt 

aaattgccga 

cctggccgcg 

tgcgggacga 

caggcctaat 

atatacagat 

atcagcctgg 

tctggcaaat 

tgcgctatga 

attcacccgc 

ttattcatga 

a tttgt caga • 

ctcttagtca 

atcatcgcct 

cgcagtttgg 

aaggct ggaa 

aagggcgtaa 

aagagaagaa 

atctgctttt 

tacgtggtct 

ctggtgtagc 

catgggatgt 

tgaaaatgcc 

aagt gatttt 

tcagtcgcga 

aaaat caagg 

accggcttgc 



agagtggtat 

cctgtgccga 

gatcctcgaa 

agaagccggc 

tgcggaatcc 

gccgatgaaa 

ctacaa tgga 

tactgataaa 

gctgccgctg 

acgcct cgac 

gccgaa t ggg 

acgtgagatc 

gatcgaagcg 

tatgcgcgtg 

tatt gaaaat 

ttttgaatac 

ccacgacagt 

cgaact ggct 

caacat gccg 

tgaagaagtt 

tcgcggtcag 

cgatcggcaa 

caaccagatt 

caacgt aatt 

tgcgctcaat 

ccgggt attg 

aaaagaagag 

agcctggcag 

acgcagcgcc 

taacagcagg 

tcgtgt cctg 

ttcgct aacg 

agaagcgccg 

cggcatggct 

aacctttgcc 

aaggcctggg 

gctttcttgt 

tcagcaggcg 

gatgaagctc 

cccagt ggga 

acggctgttt 

ggtgatttct 

ttcaaatcta 

ctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2634 



<210> 194 
<211> 1572 
<212> DNA 
<213> E . Coli 



<400> 194 

atgatgttag atatagtcga actgtcgcgc ttacagtttg ccttgaccgc 

ttcctttttg tgccactgac gctcggtatg gcgttcctgc tggccattat 

tacgtcctct ccggcaaaca gatttataaa gatatgacca agttctgggg 

ggtatcaact tcgctctggg tgtggctacc ggtctgacca tggagttcca 

aactggtctt actattccca ctatgtaggg gatatcttcg gtgcgccgct 

ggtctgatgg ccttcttcct cgaatccacc tttgtaggtc tgttcttctt 

cgtctgggta aagttcagca tatgtgtgtc acctggctgg tggcgctcgg 

tccgcactgt ggattctggt tgcgaacggc tggatgcaaa acccaatcgc 



gatgtaccac 
ggaaacggtc 
caagt tgttt 
gt tcgggact 
ggcaatcgaa 
cggttgggat 
tt caaacctg 
gtccgatttc 



60 
120 
180 
240 
300 
360 
420 
480 
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aacttt gaaa 
gctcaggtga 
ctcggtatca 
tttgctatcg 
gaat ccggct 
tgggaaacgc 
gagacgaaca 
gtggataccc 
aacgggatga 
gttcgtgacc 
tatacgccaa 
atcccgcgtg 
cttctggcaa 
aaatggcttc 
ggctggttcg 
gctgtggcga. 
ggcctgtata 
ggcccaagca 
ccggcacgct 



ctatgcgtat 
aattcgttca 
gcgcatggta 
ctgccagctt 
acgaaatggg 
aacctgcgcc 
aatttgcgat 
cggttatcgg 
aggcgtactc 
agttcaatag 
acgtggctga 
tagcgccgct 
t catcgcgct 
tgcgcgccgc 
tggctgaata 
actcgtcact 
ccctgttcct 
gcctgaaaac 
aa 



ggagatggtg 

cactgt agcg 

tatgctgaaa 

cggtatggct 

cgacgtgcag 

tgctg.cctt t . 

tcaqacccct 

cctgaaagag 

tctgct cgaa 

cat gaagaaa 

tgcgactgaa 

gtactttgcg 

ctctttctgg 

gctgtacggt 

tggccgccaa 

gaccgcaggc 

ggtggcagaa 

cggtcgctat 



agcttctccg 
tctggttatg 
ggtcgtgact 
gctgttctgt 
aaaaccaaac 
actctgttcg 
tacgcactgg 
ctgatggtgc 
caactgcgtt 
gacctcggtt 
gcgcagattc 
ttccgtatca 
agtgtcatcc 
attccgctgc 
ccgtgggcta 
gatctcatct 
ttgttcttaa 
cactttgagc 



agctggtgct 
tgactggcgc 
tcgccttcgc 
ctgttattgt 
tggctgctat 
gcattcctga 
gcatcattgc 
agcatgaaga 
ctggttctac 
acggtctgct 
aacaggcaac 
tggtggcgtg 
gcaaccgc'at 
cgtggattgc 
tcggtgaagt 
tctcaatggt 
tgttcaagt t 
agtcttccac 



taacccggtt 
gatgttcatc 
taaacgctcc 
tctgggtgat 
tgaagccgag 
tcaggaagag 
aacgcgttcc 
acgcattcgt 
cgaccaggcg 
gctgaaacgc 
caaagactcc 
tggcttcctg 
tggcgagaaa 
tgtagaagcg 
gctgccgaca 
gctgatttgc 
tgcacgcctc 
gactactcag 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1572 



<210> 195 
<211> 1140 
<212> DNA 
<213> E. Coli 



<400 
atgatcgatt 
ggttttgcag 
cgt aacgaca 
caggtttggc 
gctgcgttct 
ccggtcggtt 

tggggcatct 

ctgttgcagg 
ttcttccagt 
attactcagg 
cgtgcaacgg 
tgggtgatgt 
t ct aacccac 
aacacgccaa 
ctgactgcac 
tgcatcat cc 
atgaacgcaa 
acctgggttg 
aaaatgttcg 



> 195 
atgaagtatt 
tcactgacgg 
ccgagcgtcg 
tgatcaccgc 
ccggcttcta 
ttgactaccg 
t cattggtag 
gcgtaccgtt 
tgcttaaccc 
gcgcaaccta 
ctcaggtggc 
acggtatcga 
t gaataaaga 
ttctgtgggc 
gtatggataa 
tgacagccgg 
gtctgacaat 
cggtggttct 
gtcgtatcac 



gcgtttt at c 
ttt cgacatg 
aattatgatt 
gggcggcgca 
tgtggcgatg 
ctccaagatt 
cttcgttccg 
caacgttgat 
gttcggcctg 
tctgcaaatg 
tgcgctggtg 
tggttatgtc 
agtggttcgt 
t attccggca 
agccgcgtgg 
tatcgcaatg 
gtgggatgca 
ggtaccgatc 
caaagaagat 



tggtggctgc 

ggggtgggca 

aactccattg 
ctctttgctg 
atcctcgt gc 
gaagaaaccc 
ccgctggtaa 
gaatatctgc 
ctggcaggcg 
cgtaccgtgg 
acactggtct 
gtgaaatcga 
gaagctggcg 
ctgggtgtgg 
gcgtttgtgt 
ttcccgtttg 
acttccagcc 
attctgctct 
attgaacgta 



tggttggcgt 
tgctcacccg 
caccacactg 
cctggccgat 
tggcgtcttt 
gctggcgtaa 
ttggtgtagc 
gtctgtacta 
tggtgagcgt 
gcgaactgca 
gtttcgcact 
caatggacca 
catggctggt 
ttctgccgct 
tctcctccct 
tgatgccgt c 
agctgacgct 
acaccgcctg 
acacccactc 



tctgctgatt 
tttcctcggt 
ggacggtaac 
ggtctatgcc 
gttcttccgt 
catgtgggac 
gttcggtaac 
caccggtaac 
agggatgatc 
cctgcgtacc 
ggctggcgta 
ttacgcagcc 
t aacttcaac 
gctgaccatc 
gacgctggcc 
cagcaccatg 
taacgt catg 
gtgttactgg 
t ctgtactaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



<210> 196 
<211> 1371 
<212> DNA 
<213> E . Coli 



<400 
atggaattat 
agcgcgctgc 
gtacgt tggc 
gccgacgcaa 
cgcatcaaaa 
aaagaaaaag 
tgtacttcgg 
gaagtgat cc 
tatcgcgata 



> 196 
cctcactgac 
gcgggatttt 
tgcaaaaact 
tcggttacct 
ctatcgagcg 
tggcggagat 
aagatatcaa 
tgccatact g 
tcccgctgct 



cgccgtt tec 
cagcga.atat 
ggccgcgcac 
tgatgeaate 
taccactaac 
cccggaactg 
taacctctcc 
gcgtcaactg 
gtctcgtacc 



cctgtcgatg 
ggtttgctga 
geagegatea 
gtcgccagtt 
cacgaegtta 
cacgcggttt 
caegcattaa 
attgatggca 
cacggtcagc 



gaegctaegg 
aattccgtgt 
aggaagttcc 
teagegaaga 
aagcggttga 
ctgaattcat 
tgctgaaaac 
ttaaagatct 
cagccacgcc 



cgataa 
acaagt 
tgcttt 
agatgc 
gtattt 
ccactt 
cgcgcg 
cgccgt 
gtcaac 



agtc 
tgaa 
tget 
ggcg 
cctg 
tgee 
tgat 
tcag 
catc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ggtaaagaga tgg.caTTacgt cgcctaccgt atggagcgcc 
gtggagatcc tcggcaaaat caacggcgcg gtcggtaact 
tacccggaag ttgactggca tcagttcagc gaagagttcg 
tggaacccgt acaccaccca gatcgaaccg cacgactaca 
gttgcgcgct tcaacactat tctgatcgac tttgaccgtg 
cttaaccact tcaaacagaa aaccattgct ggtgagattg 
aaagttaacc cgatcgactt cgaaaactcc gaagggaatc 
ttgcagcatc tggcaagcaa actgccggtt tcccgctggc 
accgtgctgc gtaacctcgg cgtgggtatc ggttatgcct 
ctgaaaggcg tgagcaaact ggaagtgaac cgtgaccatc 
aactgggaag tgctggctga accaatccag acagttatgc 
ccgtacgaga agctgaaaga gctgactcgc ggtaagcgcg 
cagtttatcg atggtctggc gttgccagaa gaagagaaag 
ccggctaact atattggtcg agctatcacg atggttgatg 

<210> 197 
<211> 186 
<212> DNA 
<213> E. Coli 



agtaccgcca 
ataacgccca 
tcacctcgct 
ttgccgaact 
acgtctgggg 
gttct tccac 
tgggcctttc 
agcgtgacct 
tgattgcata 
t gctggatga 
gtcgctatgg 
ttgacgccga 
cccgcctgaa 
agctgaaata 



gcttaaccag 
catcgccgct 
gggtattcag 
gtttgattgc 
ttatatcgcc 
catgccgcat 
caacgcggt a 
gaccgactct 
tcaatccacc 
actggatcac 
cat cgaaaaa 
aggcatgaag 
agcgatgacg 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1371 



<400> 197 

atgctgattc tgactcgtcg agttggtgag accctcatga ttggggatga ggtcaccgtg 
acagttttag gggtaaaggg caaccaggta cgtattggcg taaatgcccc gaaggaagtt 
tctgttcacc gtgaagagat ctaccagcgt atccaggctg aaaaatccca gcagtccagt 
tact a a 



60 
120 
180 
186 



<210> 198 
<211> 93 
<212> DNA 
<213> E. Coli 

<400> 198 , 

ggtgaggtgg ccgagaggct gaaggcgctc ccctgctaag ggagtatgcg gtcaaaagct 60 
gcatccgggg ttcgaatccc cgcctcaccg cca 93 

<210> 199 
<211> 603 
<212> DNA 
<213> E. Coli 



<400 
atgaagaata 
ctccttaaag 
tgttttattc 
cttaaggcaa 
ctattgaata 
catgctgaga 
ttttaccata 
cacatgttac 
atccaggatt 
gcagggcgtt 
tag 



> 199 
aggctgataa 
cagcaaatac 
atggtttccg 
agtgtatata 
aagaagttca 
gcgagtgggt 
ttatctcgac 
gccattcgtg 
atcttgggca 
tttacggcat 



caaaaaaagg 
cgggcctcat 
ggcgagtgaa 
tatccatcga 
ggctttaaaa. 
atttttatca 
tt ccggtggt 
tggttttgct 
tcgcaatatt 
ctgggat aga 



aacttcctga 
gcagcacgta 
attt gt cgat 
tt aaaaaaag 
aactggttga 
cgtaagggga 
aatgccgggt 
ttggcgaata 
cgtcatactg 
gccagaggac 



cccatagtga 
attattgtct 
tgaggatttc 
gcttttcaac 
gtatccgtac 
atccgctttc 
tgtcactgga 
tgggaataga 
tctggt atac 
gacagcgtca 



aatcgaatca 
gactttgctt 
ggatatt gat 
aacgcacccg 
ttcgtacccg 
tcggcaacag 
gattcatccg 
tacgcgactt 
cgccagcaat 
cgctgtt tta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
603 



<210> 200 
<211> 597 
<212> DNA 
<213> E. Coli 



- <400> 200 

gtgagtaaac gtcgttatct taccggtaaa gaagttcagg ccatgatgca ggcggtttgt 60 
tacggggcaa cgggagccag agattattgt cttattctgt tggcatatcg gcatgggatg 120 
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aatattcgcc 
gaagccgtgg 
gctatattta 
cgcgatgccg 
gcttgcggtt 
gggcatcgaa 
ggattatggg 



PCT/USOO/02200 



aactgcttga 
gactgaagaa 
aacgct ggac 
tttctcgccg 
gtattgaagc 
atgaattggc 
atatt cgcca 
aaagaaataa 



<210> 201 
<211> 549 
<212> DNA 
<213> E. Coli 



<400 
atgaaaatta 
gctctggccg 
aacgccgctt 
cgtaccgcat 
ctgaatgat t 
attgatgcgg 
aacgttggtg 
tttagttcag 
tttgcaaccg 
tatcaataa 



> 201 
aaactctggc 
ctgccacgac 
gcgcagt tga 
cgctggcaca 
gcgataccaa 
gtcataccaa 
tgcagatcct 
aaacaaccct 
gggccgcaac 



<210> 202 
<211> 648 
<212> DNA 
<213> E . Coli 



<400 
gtgctgctaa 
gacgtcatta 
agattattt c 
accacgttgc 
attgaagccg 
catgcggttg 
acggtggtga 
gatgtgct tt 
gatgatgaag 
tcaggctcta 
ggcggcatcg 



> 202 
tgcggatgcg 
cgggcaggga 
tattagcgtc 
ccggcggaaa 
gtgataaaca 
gggaagatag 
gtgaacgtgt 
ccgtgggaga 
gaaacctcgt 
cttcgctaca 
ccaatgccca 



<210> 203 
<211> 726 
<212> DNA 
<213> E. Coli 



t ctgcattat 
cggattttct 
ccaggaacgt 
cgggagtcgg 
tggaaccgta 
ggagcgtggt 
tactgtgcgt 
tctcataaac 



aatcgttgtt 
ggttaatggt 
tgcaggctct 
ggaaggagca 
tgttgcatct 
cgttctggct 
ggacagaacg 
gaataacgga 
cccgggtgct 



acctt caagg 
tgcccaccct 
gttgctgcca 
tatgcaattt 
aatgacggtc 
cgcaccggtg 
aggtgtggcg 
ggggccaggg 
accgattaat 
tttcatcgcc 
ggcctggttc 



caggaccttg 
accgttcacc 
gctaactgga 
ctttctcgcc 
acgcagactc 
gcagatactc 
tataccgcca 
gaaaaa ttaa 



ctgtcggctc 
gggaccgttc 
gttgatcaaa 
accagttctg 
aaagccgctg 
ctgcagagtt 
ggtgctgcgc 
accaatacca 
gctaatgcgg 



ttcagtatca 
tgtgcgataa 
atgtttgctc 
cagggcgtca 
aatatggggc 
ccttttgtta 
tttcacggtg 
atagccacca 
cgtcctccag 
aaatatcgtg 
tctttaacct 



accttaat ga 
cgttacgttt 
aaggcgctga 
agcaggccta 
atcctcatat 
gtttaattca 
gtaatgctgc 
aaagagaaga 



tgtccctcag 
actt taaagg 
ccgttcagt t 
ctgtcggttt 
ttgccttttt 
cagctgcggg 
tgacgctgga 
tt ccgt tcca 
atgcgacctt 



ataacctacc 
aaataacga t 
tggccggaaa 
tt attgcgga 
aaat cagcag 
tt catttacg 
t cgcggatgg 
at a ttggcgt 
caaact ggaa 
ctaccgggcg 
atcagtaa 



aggtagaata 
tgatgagcgt 
ccggactgac 
tcgcattatt 
gttaaggcat 
ggattatctc 
tcgttttgcc 
ggtttga 



ttctacagcg 
ggaagttgtt 
aggacaggtt 
taaca ttcag 
aggtacggcg 
tagcgcaaca 
tggtgcgaca 
ggcgcgttat 
caaggttcag 



caggttcagg 
gaaaaggaag 
taaatggaat 
aacttgccgg 
taaccggttt 
ggaatgtagc 
taaaaa tccg 
agcgtt gttt 
acggctttat 
tcgggttact 



180 
240 
300 
360 
420 
480 
540 
597 



60 
120 
180 
240 
300 
360- 
420 
480 
540 
549 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
648 



<400> 203 

gtgagtaata aaaacgtcaa tgtaaggaaa tcgcaggaaa taacattctg 
ggtatcctga tgttcatggc aatgatggtt gccggacgcg ctgaagcggg 
ggtgcgactc gcgtaattta tccggcaggg caaaaacaag agcaacttgc 
aatgatgaaa atagtaccta tttaattcaa tcatgggtgg aaaatgccga 
gatggtcgtt ttatcgtgac gcctcctctg tttgcgatga agggaaaaaa 
ttacgtattc ttgatgcaac aaataaccaa ttgccacagg accgggaaag 
atgaacgtta aagcgattcc gtcaatggat aaatcaaaat tgactgagaa 
ctcgcaatta tcagccgcat taaactgtac tatcgcccgg ctaaattagc 
gatcaggccg cagaaaaatt aagatttcgt cgtagcgcga attctctgac 
ccgacaccct attacctgac ggtaacagag ttgaatgccg gaacccgggt 
gcattggtgc ctccaatggg cgaaagcacg gttaaattgc cttctgatgc 



cttgct ggca 
agtggcct ta 
cgtgacaaat 
tggtgt aaag 
agagaatacc 
tttattctgg 
tacgct acag 
gttgccaccc 
gctgattaac 
t ct tgaaaat 
aggaagcaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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attacttacc gaaca^taaa tgattatggc gcacttaccc ccaaaatgac gggcgtaatg 
gaataa 

<210> 204 
<211> 2637 
<212> DNA 
<213> E . Coli 



720 
726 



- <400 
atgtcatatc 
catcgtttgg 
cctttgtcat 
gtggccgatt 
gatatctatt 
agtgaacaag 
acggcttctg 
atggtccagg 
cctcaggcat 
ggtattaatg 
gggggt aaca 
cgtttacgcg 
aataaatggc 
ctgacgctgg 
gcacaattgg 
at ccacggta 
tataatagta 
agtggtgact 
ccctattcgt 
ggagaatacc 
ctccacggcc 
cgtgctttta 
atgacgcagg 
tttctctata 
tattcgacca 
aacattgaaa 
ctcgcttata 
tcaacactgt 
caatt ccagg 
ctgacgaaaa 
cctttcagcc 
tacagcatgt 
ttgctggaag 
ggaaatagcg 
aatatcggtt 
gtactggctc 
gttaaagcgc 
tggcgtggtt 
gataccaata 
actcgtgggg 
acgctgaccc 
cagagtagcg 
ggaaaagttc 
ctgccaccag 



> 204 
tgaatttaag 
ctggtttttt 
ctgccgacct 
tatcgcgttt 
tgaataatgg 
ggattgttcc 
tcgccggtat 
acgctactgc 
ttatgagtaa 
ccggattgct 
gccattatgc 
acaataccac 
agcatatcaa 
gtgat ggtta 
cctcagatga 
ttgctcgtgg 
cggtgccacc 
tgcaggtaac 
cagtcccgct 
gtagtggaaa 
t tccggctgg 
atttcggtat 
ctaattccac. 
acaaatcgct 
gcggatattt 
cacaggacgg 
acaaacgcgg 
atttgagtgg 
ctggattaaa 
acgcctggca 
actggctgcg 
cacacgatct 
acaacaacct 
gaagtacagg 
acagccatag 
at gccaatgg 
ctggcgcaaa 
atgccgtgct 
ccctggctga 
cgatcgtgcg 
acaataataa 
gcattgttgc 
aggtgaaatg 
agagtcagca 



actttaccag 

tgtccgact c 

ctattttaat 

tgaaaatggg 

tt atatggca 

ctgcct gaca 

gaatctgctg 

gcatct ggat 

tcgcgcgcgt 

caattataat 

atatttaaac 

ctggagttat 

tacctggct t 

tact cagggc 

caatat gtt a 

tactgcacag 

ggggcctttt 

gatcaaagag 

tttgcaacgt 

t gcgcagcag 

ct ggacaata 

cgggaaaaac 

acttcccgat 

caatgaatca 

taatttcgct 

agttattcag 

gaaatt acaa 

tagccatcaa- 

tactgcgttc 

aaaaggacgg 

ttctgacagt 

caacggtcgg 

cagctatagc 

ctacgccacg 

cgatgatatt 

cgtaacgctg 

agatgcaaaa 

gccttatgcc 

taacgt cgat 

agcagagt tt 

gccgct gccg 

ggataatggt 

gggagaagag 

gcagttatta 



cgaaacacac 
gttgtcgcct 
ccgcgctttt 
caagaattac 
acgcgtgatg 
cgcgcgcaac 
gcggatgatg 
gttggtcagc 
ggttatattc 
ttcagcggaa 
ctacagagtg 
aacagtagcg 
gagcgagaca 
gatattttcg 
cccgatagtc 
gtcactatta 
accatcaacg 
gctgacggca 
gaagggcata 
gaaaaaaccc 
tatggtggaa 
atgggggcac 
gacagtcagc 
ggcacgaata 
gatacaacat 
gttaagccga 
ctcaccgtta 
acttattggg 
gaagatatca 
gatcagatgt 
aaatctcagt 
atgaccaatc 
gtgcaaaccg 
ctgaattatc 
aagcagct ct 
gggcagccgt 
gt cgaaaacc 
actgaatatc 
ttagataacg 
aaagcgcgcg 
t ttggggcga 
caggtttacc 
gaaaatgctc 
acccagctat 



aatgcttgca 

gtgcttttgc 

tagcggatga 

cgccagggac 

tcacatttaa 

tcgccagtat 

cctgtgtgcc 

agcgactgaa 

ctcctgagtt 

atagtgtaca 

ggttaaatat 

acagatcatc 

taataccgtt 

atggtattaa 

aaagaggatt 

aacaaaatgg 

atatctatgc 

gcacgcagat 

ctcgttattc 

gctttttcca 

cgcaactggc- 

tgggcgctct 

atgacggaca 

ttcagttagt 

acagt cgaat 

aattcaccga 

ctcagcaact 

gaacgagtaa 

actggacgct 

tagcgcttaa 

ggcgacatgc 

tggctggtgt 

gctatgccgg 

gcggtggtta 

attacggagt 

taaacga'tac 

agacgggggt 

gggaaaatag 

cggttgctaa 

ttgggataaa 

tggtgacatc 

tcagcggaat 

actgtgtcgc 

cagctgaatg 



tattcgtaag 

cgcacaggca 

tccccaggct 

gtatcgcgtc 

tacgggcgac 

ggggctgaat 

attaaccaca 

cctgacgatc 

atgggatccc 

gaatcggatt 

tggtgcgtgg . 

aggtagcaaa 

acgttcccgg 

ctttcgcggc 

tgccccggtg 

gtatgacatt 

cgcaggtaat 

ttttaccgta 

cattacggca 

gagtacatta 

ggatcgttat 

igtctgtggat 

atcggtgcgt * 

gggttaccgt 

gaatggctac 

ctattacaac 

cgggcgcaca 

tgtcgatgag 

cagctatagc 

cgt caatatt 

cagtgccagc 

atacggtacg 

gggaggcgat 

cggcaatgcc 

cagcggtggg 

ggtggtgctt 

gcgtaccgac 

agtggcgctg 

cgttgttccc 

actgctcatg 

agagagtagc 

gcct t tagcg 

caattat caa 

tcgttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
.2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2637 



<210> 205 
<211> 531 
<212> DNA 
<213> E. Coli 



<400> 205 

atgagaaaca aaccttttta tcttctgtgc gcttttttgt ggctggcggt gagtcacgct 



60 
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ttggctgcgg 
gccgctgaat 
aacattggcg 
gccgtttctg 
cttgcacttg 
cagcaaaatc 
ccgggtaaac 
actgcggggc 



atagcacgat 
caaccaattt 
cgacgactcc 
ccgt aaaggt 
aaaatacggt 
aaat acccct 
caaatacgct 
atat caatgc 



t actatccgc 
tactgttgat 
tgttgttcca 
tgggtttact 
gtcagcggct 
taatgctcca 
gaatttttac 
cacggctacc 



ggctatgtca 
ctgatggaaa 
tttcgtattt 
ggcgttgcag 
tcgggactgg 
tcgtccgcgc 
gcccggcraa 
ttcactcttg 



gggat aacgg 
acgcggcgaa 
tgctgtcacc 
atagccacaa 
gaatacagct 
tttcgtggac 
tggcgacaca 
aatatcagta 



ctgtagtgtg 
gcaatttaac 
ctgtggtaat 
tgccaacctg 
tctgaatgag 
gaccctgacg 
ggtgcctgtc 



<210> 206 
<211> 504 
<212> DNA 
<213> E. Coli 



<400 
atgaaatggt 
atacaggcag 
gtt tccacca 
gccggggcgg 
acgtcgaggg 
caggggaccg 
actggcgcaa 
caggtcagag 
agcatcacct 



> 206 
gcaaacgtgg 
ccgatgtcac 
ccaatgcca'c 
catcggcctg 
tcactgccag 
cgcaaaacat 
ccaaaacagt 
cattgacagt 
at acctacag 



gtatgtatt g 
catcacggtg 
ggtrgatctc 
gcatgatgt t 
cttcagcggg 
ccagttagag 
tcaggtggat 
aaatggcgga 
ct ga 



gcggcaatat 
aacggt aagg 
ggcgatcttt 
gcgcttgagt 
g.cagccgaca 
ctacaggat g 
gattcctcac 
gccactcagg 



tggcgctcgc 
t cgtcgccaa 
attcttt cag 
tgactaattg 
gtaccggata 
acagtggcaa 
aatcagcgca 
gaaccattca 



aagtgcgacg 
accgtgtacg 
tcttatgtct 
tccggtggga 
ttataaaaac 
cacattgaat 
cttcccgtta 
ggcagtgatt 



<210> 207 
<211> 903 
<212> DNA 
<213> E. Coli 



<400 
atgaaacgag 
tcattcgcct 
gtttatgtaa 
acgcaaatct 
cgaggctcgg 
agtagctat c 
gataagccgt 
attaaagctg 
gatgattt cc 
ggctgcgatg 
ccaattcctc 
acaaccgcag 
ggcgtcggcg 
ttaggagcag 
ggagggcagg 
taa 



> 207 
ttattaccct 
gtaaaaccgc 
accttgcgcc 
tttgccataa 
cttatggcgg 
cat ttcctac 
ggccggtggc 
gctcattaat 
agtttgtgtg 
tt tctgctcg 
ttaccgttta 
atgcgggcaa 
tacagttgac 
tagggacttc 
tgactgcagg 



gtttgctgta 
caatggtacc 
cgtcgtgaat 
cgattatccg 
cgtgttatct 
caccagcgaa 
gctttatttg 
tgccgtgctt 
gaatatttac 
tgatgtcacc 
ttgtgcgaaa 
ctcgattttc 
gcgcaacggt 
ggcggtgagt 
gaat gtgcaa 



ctgctgatgg 
gctatcccta 
gtggggcaaa 
gaaaccatta 
aatttttccg 
acgccgcgcg 
acgcctgtga 
attt tgcgac 
gccaat aatg 
gttactctgc 
agccaaaacc 
accaataccg 
acgattattc 
ctgggattaa 
tcgattattg 



gctggtcggt 
ttggcggtgg 
acctggtcgt 
cagactatgt 
ggaccgtaaa 
ttgtttataa 
gcagtgcggg 
agaccaacaa 
atgtggtggt 
cggactaccc 
tggggtatta 
cgtcgttttc 
cagcgaataa 
cggcaaatta 
gcgtgacttt 



aaatgcctgg 
cagcgccaat 
ggatctttcg 
cacact gcaa 
atatagtggc 
ttcgagaacg 
cggggtggcg 
ctataacagc 
gcctactggc 
tggttcagtg 
cctctccggc 
acctgcacag 
cacggt atcg 
tgcacgtacc 
tgttta tcaa 



<210> 208 

<211> 1631 

<212> DNA 

<213> E. Coli 



120 
180 
240 
300 
360 
420 
480 
531 



60 
120 
180 
240 
300 
360 
420 
480 
504 



60 
120 
180 
240 
300 
.360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
903 



<400 
gtgctgtcaa 
catcggggtg 
t cggcgggta 
tgggctgcaa 
ctttatttca 
tcaagactat 
tgtagcctta 



> 208 
aactaccccg 
aggatatgaa 
atgctggttt 
t cggtgtgct 
agattaaaga 
gaactggttg 
caggacaaag 



tagactccga 
atcaatggat 
ctgggcattg 
gggtagcctg 
agacaggcgt 
tgaaaggagt 
aacgctttga 



tcttttcaaa 
aagttaacaa 
cagttactcg 
gtttttggcc 
aaggctgcga 
ccgtaatttt 
cggtgaaatt 



catattgcac 
caggtgttgc 
ataaagtaac 
tgctgacgta 
gaggagagta 
gagaataaag 
tttgacctgg 



catccgtgta 
ctatggcaca 
tccgtcacag 
tctgacaaat 
atccaatgac 
ttacggtaac 
atgtcgccat 



60 
120 
180 
240 
300 
360 
420 
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ggaccgtgtt gaaggSgctg cgctggagtt ttatgaggca 

gcaagtcttc ctggaagtag cagaaaaatt gtcagaaaaa 

tcagtactcc tttaagattg aaaatcctgc caataagcac 

tctatgaaca caaaaatcag atacggcctg tcggctgccg 

ggcgcatctg ctcctcagat acttgaccag tttctggacg 

atggcatacc gcgatggttc tggcatatgg accatctgtc 

ggaaaaaccg tttttcccaa tatgaaactg tcgaaggaaa 

attgagcgtg ataaggcgct ggcatgggtg gagcgcaata 

ccacaaaaag cgggtatcgc gtcattttgt ccctataaca 

ccgtcgacgt tttataagcg gctgaatgct ggtgatcgta 

cgctggtgga ttaaggatgg cggacgcgat tgccgcattc 

caggttattc gtcgtgacca ggagagcgca ttaacctgct 

agatattcat ggtgattttt ctcgtgttgt caggatttat 

accgaggatg gcaaaaaaaa tgggcggaac gtgatgctgc 

atgctcaatt tgctgctcga ataattgaac aggggcgaac 

ttaaagatgc gcaacagaaa tctgctgaaa tttctgccag 

gtgttaacca gttgcgtgcc gaagcaaaaa aatatgccat 

ataccgcaga tcttgccgct gccgtcagag gcaaaacaac 

tcaccaacat gctcggagat attgcagcag aagctcagct 

aacgctacat cgcaggagtg acttgtcaac agatctatga 
atcaaatgta g 



gcagccagaa 
gttgagtct t 
gagcgtcctc 
t tctggcgct 
aaaaagaagg 
ggggtgccac 
aatqcgacca 
ttaaagtacc 
ttggccccgg 
aaggtgcatg 
gt tcaaataa 
gggggataga 
cgtcggaaat 
cgcattatca 
tatagcccgt 
ggctgcttat 
acgccttgac 
caaaaccgcc 
ttatgctgaa 
atctttaaga 



ggagcgtccg 
atctgcagca 
atcataaata 
gattggtgct 
taaccacaca 
agtggt ggat 
ggtcaacgcc 
actgaccgaa 
taagtgtttc 
cgaagcgatt 
ctgttacggt 
acagtgaatc 
gtctggagcg 
caagaggtaa 
ga tgaggctg 
ctgtctgata 
gcagcgaagc 
gaaggaatgc 
attgctgacg 
gataaaaagc 



480 
540 
600 
660. 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1631 



<210> 209 
<211> 534 
<212> DNA 
<213> E. Coli 



<400 
atgaacacaa 
gcatctgctc 
gcat accgcg 
aaaaccgttt 
gagcgtgata 
caaaaagcgg 
tcgacgtttt 
tggtggatta 
gttattcgtc 



> 209 
aaat 
ctca 
atgg 
ttcc 
aggc 
gtat 
ataa 
agga 
gtga 



cagata 
gatact 
ttctgg 
caatat 
gctg'gc 
cgcgtc 
gcggct 
tggcgg 
ccagga 



cggcctgtcg 
t gaccagttt 
catat ggacc 
•gaaactgtcg 
atgggtggag 
attttgtccc 
gaatgctggt 
acgcgattgc 
gagcgcatta 



gctgccgttc 
ctggacgaaa 
atctgtcggg 
aaggaaaaat 
cgcaatatta 
tataacattg 
gatcgtaaag 
cgcattcgtt 
acctgctggg 



tggcgctgat 
aagaaggtaa 
gtgccacagt 
gcgaccaggt 
aagtaccact 
gccccggtaa 
gtgcatgcga. 
caaataactg 
ggatagaaca 



tggtgctggc 
ccacacaatg 
ggtggatgga 
caacgccatt 
gaccgaacca 
gtgtttcccg 
agcgattcgc 
ttacggtcag 
gtga 



60 
120 
18 0 
240 
300 
360 
420 
480 
534 



<210> 210 

<211> 312 

<212> DNA 

<213> E . Coli 

<400> 210 

atgactcaag actatgaact ggttgtgaaa ggagtccgta 

gtaactgtag ccttacagga caaagaacgc tttgacggtg 

gccatggacc gtgttgaagg agctgcgctg gagttttatg 

gtccggcaag tcttcctgga agtagcagaa aaattgtcag 

cagcatcagt actcctttaa gattgaaaat cctgccaata 
aaatatctat ga 

<210> 211 
<211> 291 
<212> DNA 
<213> E . Coli 



attttgagaa 
aaatttttga 
aggcagcagc 
aaaaagtt ga 
agcacgagcg 



taaagttacg 
cctggatgtc 
cagaaggagc 
gtcttatctg 
tcctcatcat 



60 
120 
180 
240 
300 
312 



<400> 211 

gtgctgtcaa aactaccccg tagactccga tcttttcaaa catattgcac catccgtgta 60 

catcggggtg aggatatgaa atcaatggat aagttaacaa caggtgttgc ctatggcaca 120 

tcggcgggta atgctggttt ctgggcattg cagttactcg ataaagtaac tccgtcacag 180 

tgggctgcaa tcggtgtgct " gggtagcctg gtttttggcc tgctgacgta tctgacaaat 240 



-72- 
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ctttatttca agattaaaga agacaggcgt aaggctgcga gaggagagta a 

<210> 212 
<211> 216 
<212> DNA 
<213> E . Coli 



<400> 2i2 

atgtcaaata aaatgactgg tttagtaaaa tggtttaacg ctgataaagg tttcggcttt 

atttctcctg ttgatggtag taaagatgtg tttgtgcatt tttctgcgat tcagaatgat 

aattatcgaa ccttatttga aggtcaaaag gttaccttct ctatagagag tggtgctaaa 

ggtcctgcag cagcaaatgt catcattact gattaa 

<210> 213 

<211> 1017 

<212> DNA 

<213> E. Coli 



PCT/US00/02200 

291 



60 
120 
180 
216 



<400> 213 

atgtttgtca tctggagcca tagaacaggg ttcatcatga gtcatcaact taccttcgcc 60 

gacagtgaat tcagcagtaa gcgccgtcag accagaaaag agat tttctt gtcccgcatg 120 

gagcagattc tgccatggca aaacatggtg gaagtcatcg agccgtttta ccccaaggct 180 

ggtaatggcc ggcgacctta tccgctggaa accatgctac gcattcactg catgcagcat 240 

tggt acaacc tgagcgat.gg cgcgatgg'aa gatgctctgt acgaaatcgc ctccatgcgt 300 

ctgtttgccc ggttatccct ggatagcgcc ttgccggacc gcaccaccat catgaatttc 360 

cgccacctgc tggagcagca tcaactggcc cgccaattgt tcaagaccat caatcgctgg 420 

ctggccgaag caggcgtcat gatgactcaa ggcaccttgg tcgatgccac catcattgag 4 80 

gcacccagct cgaccaagaa caaagagcag caacgcgatc cggagatgca tcagaccaag 540 

aaaggcaatc agtggcactt tggcatgaag gcccacattg gtgtcgatgc caagagtggc 600 

ctgacccaca gcctggtcac caccgcggcc aacgagcatg acctcaatca gctgggtaat 660 

ctgctgcatg gagaggagca atttgtctca gccgatgccg gctaccaagg ggcgccacag 720 

cgcgaggagc tggccgaggt ggatgtggac tggctgatcg ccgagcgccc cggcaaggta 780 

agaaccttga aacagcatcc acgcaagaac aaaacggcca tcaacatcga atacatgaaa 840 

gccagcatcc gggccagggt ggagcaccca tttcgcatca tcaagcgaca gttcggctt c 900 

gtgaaagcca gatacaaggg gttgctgaaa aacgataacc aactggcgat gttattcacg 960 

ctggccaacc tgtttcgggc ggaccaaatg atacgtcagt gggagagatc tcactaa 1017 

<210> 214 
<211> 474 
<212> DNA 
<213> E. Coli 



<40 

atggtatat 
aatct taat 
ttattgaaa 
ggctttggt 
gatgatgat 
acatatatt 
gatgaagcg 
attgtgtca 



0> 214 
a taataa 
ctgacg 
aaatat 
ataata 
acattt 
aatatg 
aatctt 
ttatgt 



tcgt 
atga 
gcca 
ataa 
tgtt 
tcga 
taca 
tagg 



tt cccacgga 
gcactacaag 
gcattatgca 
tattgcggtg 
tttgaatccc 
aagtaagcgt 
tgattattcc 
ga ttaaggaa 



catgaagact 
attatcgtac 
ggcctggact 
gcgtatgtaa 
gatatcatca 
tatgctttta 
gt aagaaaat 
ggtgcgaaca 



acatcaaaaa 
gcgacaacaa 
atattagtgg 
aggaaaaata 
tgaagcat ga 
gtacattatg 
ttcctgtgct 
agtccctgat 



attactcgaa 
agact ctcta 
aggtgtatac 
tagacccgca 
tgatttgctg 
cctgttccga 
ttctgatttt 
atga 



<210> 215 
<211> 1119 
<212> DNA 
<213> E . Coli 

<400> 215 

atgggaaaaa gcatagtcgt tgtttctgcg gtcaatttta ccactggcgg tccatttacc 
attttgaaaa aatttttggc agcaactaat aataaagaaa atgtcagttt tatcgcatta 
gtccattctg ctaaagagtt aaaagaaagt tatccatggg ttaaattcat tgagt ttcct 



60 
120 
180 
240 
300 
360 
420 
474 



60 
120 
180 
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gaggttaaag 
tcaaaagagc 
actaaaaaaa 
cgtgaaattc 
aaaataaaca 
tttatcaaga 
gataaaagcc 
atattttacc 
aggaaattga 
gcgtatgcaa 
tacttggata 
tctaggttag 
ttagcatcag 
tttgattcta 
ctcaaaaaag 
tttgatgaac 



ggtcgtggct 

t gaatgctac 

•gatatgtgta 

t tatggagcc 

ttaaaaaaaa 

aatattctat 

aacttactga 

ctgctgttcc 

aagaacaat c 

aatatattat 

aagaaaaaat 

aaacatgggg 

atttcccatt . 

ataacgatga 

atatct ctga 

tagttaattt 



aaaacgt ttg 
gcattggatt 
t tgtcataac 
tagctttttc 
tactgcagtg 
aaataacatc 
tgatgattct 
acgagtattt 
caatattaaa 
cagtcttgca 
cgatcattgt 
attgccgttg 
tactagagaa 
catgttagtt 
tgcaaatttc 
tattactgaa 



cactt.tgaat 
tgtctgcatg 
cctgcacctt 
ttatt taaaa 
tttgttcaac 
attgtcaqtc 
caatttaaga 
aaaaatta'cg 
tttctgctta 
gaaggactgg 
tataatattt 
tctgaagcta 
actcttggta 
aaacttatta 
atttatcgta 
gaacattga 



atgtagtttg 
at at t acggc 
tttat aaagg 
tgctat acgg 
aatt ctggat 
ggccagaaat 
ataacccttc 
agcttattat 
ctatcagtgg 
ataatgttca 
cagatatagt 
aagagcgagg 
gttat gaaaa 
ttgacttcaa 
atgaaaatgt 



taaaa-aactt 
caatgt cgtc 
aattttattc 
gctgatatat 
gaaagaaaaa 
taaattatct 
tgagtt gaca 
tagtgcagca 
tacagaaaat 
tttcct cggg 
ttgttttccc 
taagtgggta 
gaaagctttt 
aaaaggtaac 
attagt tggg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
.1080 
1119 



<210> 216 
<211> 591 
<212> .DNA 
<213> E. Coli 



<400 
atgatcttaa 
gtcttattga 
cgcaatgatg 
gatgcatttg 
catatcgcct 
tttattaccg 
atacct ccag 
ttgggtgaga 
gccaattctg 
aaaatcataa 



> 216 
aactcgctaa 
ctcgtgtatt 
gtagcattaa 
gacgtggcgt 
caat tgagag 
atcataatca 
acatgcgcac 
atgtgacggt 
ttgttagagg 
agaaatacaa 



<210> 217 
<211> 993 
<212> DNA 
<213> E . Coli 



acgatatggt 
tt accggaac 
ttttggtgaa 
gatttttttt 
cgttacgata 
cggttccttt 
gttggaat ct 
tttgcctgga 
ttctattccc. 
tcatgagacc 



ctct gtggtt 
t gtagaatta 
aatttcacaa 
tccgataatg 
ggtcgggata 
aagcactctg 
tcagctgttg 
acaattattg 
gaaaatactg 
aaattatggg 



ttatt cggct 
ttcgatttcc 
gtggagtcgg 
tgcaagttaa 
cgcttattgc 
atccaat gag 
taatt ggcca 
gtaatggagt 
tcattgcggg 
aaaaagcata 



tgt tagagat 
ctgctatatt 
tctcaggctg 
cgactatgtt 
aagtaaagta 
ttcgccaaat 
gagggtttgg 
cgtagt cggc 
agtaccagca 

g 



60 
120 
180 
240 
300 
360 
420 
480 
540 
591 



<400 
atgtattttt 
gatgcactgg 
ggtggagtag 
gaaaataaag 
tcattctttc 
ttaagaggag 
cacaat ccac 
ataaaaatat 
caacgagggg 
gaaggatgcg 
cttggaagtt 
ctcat t tggg 
• tttaataacc 
gataaagccg 
atcaaagaaa 
gagaatacaa 
gaagaggtga 



> 217 
tgaatgattt 
acattgcttc 
tccagagaat 
atgttttaat 
accgccttct 
gagggggtag 
aaatgacaaa 
ttgatt acct 
tcatatatgc 
attttactct 
ttgatgct ca 
atggagatt c 
ct ca taagac 
ccct tgcgga 
tgcaagaga t 
aaattatttc 
tcgatgatct 



aaatttctct 
agattatgaa 
tattagttct 
tttcaatttc 
aaaatttaga 
tgattctgtg 
gtaccttagt 
cgtctcatct 
tggcaacctt 
ctttggtgtc 
atctccggaa 
tgtcgaaacc 
atctctttat 
tttcattgta 
tgttgactcc 
tcagaaaat t 
taaaactcgc 



agacgcgatg 
aacatttctg 
gttaagct ta 
ccgatggcca 
atagtacctc 
cggcttgcta 
aaat-at atgt 
gatgtggagc 
tctaggcata 
aactatgaaa 
aagattaacc 
tgtagtggtg 
cttt caatgg 
gataatagaa 
atgacaatag 
cgaacaggaa 
taa 



ctgga tt t aa 
t tgttaacat 
gtacatttct 
aaccattttg 
tgatt catga 
cctgtgatat 
ctcaggataa 
atcgaga tgt 
aatgttcttt 
ataaagataa 
t cccaggcat 
cct t tggcga 
aact tccagt 
taggatatgc 
aaactt ataa 
gttacttcag 



agcaagaaaa 
tcctctatgg 
ctgcggtctt 
gcatatattg 
tattgatgaa 
ggtcataagt 
aatcaaagac 
tacggataag 
catatatact 
t cctaaatat 
gcaatttgga 
ctattt aaag 
atttatatgg 
agtgggatca 
gcaaattagt 
ggatgttctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
.780 
840 
900 
960 
993 



<210> 218 
<211> 1167 
<212> DNA 
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<400 
atgatctatc 
aaggatatat 
tatgaaataa 
ctttgcaatg 
aatatcagaa 
■ ggcttgttag 
ttcgggacta 
tcaagaaatt 
tggtctaaaa 
gtattcatct 
tcttatgcat 
gtaggtgttc 
tattatctat 
caagtatcta 
tt aacaggtg 
aatgtttata 
atggttattc 
gtgaaaatat 
ttcatgacta 
tttcttaagg 



> 218 
ttgtaattag 
tttatccagc 
cgtcagatat 
tttrgacatt 
aagtcaataa 
ttatttcttt 
gcttacttag 
tctcagcata 
aatttactaa 
ttgcaattat 
tcatcgtagg 
tattctcctt 
ccatgtattt 
actctgccag 
gagtctctat 
ctgctttttc 
atggctgtat 
tttattcata 
atattagcag 
cccagaaaat 



tgtctttctc. 

cgtatgcgtt 

atatgctttt 

taccctgtca 

tgctatttat 

ttcgatgata 

ctatat gaat 

catgcagcca 

tacaaaggta 

actgaatact 

tgttaataga 

gtatatgctc 

ggtcagccct 

ttctcatgtc 

gtcg-ttgcat 

ggattatgtt 

ttcaggtgtt 

ttttatttat 

ttggatacaa 

aaagtga 



attacagcat 
aatatcatct 
cagttaaat q 
tgtttattga 
agcataccat 
tatat atgca 
ttgataagag 
atcattctaa 
agtaaaacat 
ggtaagcaaa 
gtaaaacatt 
tttttacgtg 
ataatcgcgt 
ttttggtttt 
aaagaatttg 
tatatttccg 
ttatggaga t 
accttttctt 
ataactcttt 



ttatctgt.tt 
tcgcactggt 
acgctacgtt 
cggaaagtgt 
cgaagaaagt 
tgaggtt aag 
atgctgatgt 
ctacttttgc 
ttactttact 
ttgtctttat 
atgtttatct 
gactgcctgg 
t tcaggagtt 
t tgaaaggct 
tgtgggtggg 
cggagctaag 
t gtctcgaaa 
tcatttttta 
gtatcatagt 



atatcttaag 
cttattggga 
gatttttcta 
attagatcta 
gcataatgta 
taactaccag 
tgaagacaca 
tttatttatt 
tgtttttatt 
ggttatcatc 
tat tacagct 
ggggatggca 
ttatttt cag 
gatggggcta 
tttgccaaca 
ctatttgatg 
ttacatatct 
tcatgaaagc 
attctctcaa 



<210> 219 
<211> 1104 
<212> DNA 
<213> E. Coli 



<400 
atgtacgatt 
aaaaagctaa 
tacacagagg 
aatgataaat 
tctccactgg 
caccaaatgt 
aaaaagtatg 
gaggacttat 
aaagaattgc 
tatttttccg 
atgcttgaag 
gcgagtaaag 
tttggagcgt 
ttccaaggga 
gagcataaac 
ttagagtgga 
ctttttaaga 
ttggccgagt 
aaaaatataa 



> 219 
atatcattgt 
acaaaaaagt 
actgtgaggg 
atatatggga 
cgatttataa 
ggggagttaa 
gtgacaaggt 
accaagcatt 
ctgcatttat 
atcgctatca 
gtgtggacgt 
cccatagaat 
t agaatatcg 
atgcagtaat 
attttgacta 
aagttggcga 
aatatagaga 
ataaatatta 
tgagtacgga 



tggttctggt 
•tttagtgatt 
tatccagatt 
ttacgttaat 
agacaaatta 
agatcctcaa 
acctgaaaat 
gataaagggt 
t attaagcga 
aggr a ttccg 
aaaattaggc 
catctacact 
ctctt taaaa 
aaattt cact 
tgttgagaca 
cgaaccctac 
gttagctagc 
tgatatgcat 
ttaa 



ttgtttggtg 
gagaaaagaa 
cataaatatg 
gatttagtag 
ttcaaccttc 
gaagct caaa 
ttggaggagc 
tatacggaga 
atcccagtga 
gtgggaggct 
attgattttt 
ggacccattg 
tt tgagacgg 
gatgctaatg 
aagcatacgg- 
tatccagtta 
agagaagaca 
caagtgatat 



ccgtttgtgc 
atcatatcgg 
gt gcacatat 
aatttaatcg 
cttttaatat 
atatcattaa 
aggcgatttc 
agcagtgggg 
gatttacgtt 
acactaagct 
tgaaagacaa 
atcagtactt 
aacgccatga 
taccatatac 
tt gttacaaa 
atgataataa 
aggttatatt 
ctgccgctct 



gaatgagtta 
tggaaatgcg 
ttttcatacc 
ttttactaat 
gaatactttc 
tgctcagaaa 
attagt tggg 
aagaagtgca 
tgataacaat 
tattgaaaaa 
agattctcta 
cgactatagg 
atttccaaac 
cagaat aatt 
agaata tcca 
aaacat ggag 
tggcgggcgt 
ttatcaagtg 



<210> 220 
<211> 1116 
<212> DNA 
<213> E . Coli 

<400> 220 

atgttcccaa aaataatgaa tgatgaaaac tttttcaaaa aagcggcggc gcacggggag 

gaacctcctt taactcctca aaacgaacat cagcggtccg ggctgcgctt cgcccgtcgc 

gtcagactac cccgtgcggt tggcctggct ggcatgttct taccgattgc ttcaacgctg 

gtttcacacc cgccgccggg ctggtggtgg ctggtgttgg tcggctgggc gttcgtctgg 

ccgcatttag cctggcagat agcgagcag.g gccgtcgatc cgcttagccg ggaaatttac 



60 
120 
180 
240 
300 
360 
.420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1167 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1104 



60 
120 
180 
240 
300 
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aacttaaaaa ccgal^agt attagcggga atgtgggtag 

ctgccttcca ccgcgatgtt gatgattatg tgtctgaatt 

cgtctgtttg tcgcgggtct ggtgttgatg gtggtttcct 

acgggcatta ccgtgtcgtt caatagtgcg ccgctggaat- 

attgtcattt atcctctgct gtttggctgg gtcagctacc 

gaacataaac gcaggttgca ggtcatgagt acccgcgacg 

cgacgtcatt gggaaactat gttacgcaat gaatttgata 

gatgcaacgt tactgattat cgatatcgac catttcaaga 

catgatgtgg gcgatgaagc gattgtggcg cttacccgac 

ggtagcgatg tgattggtcg gtttggcggc gatgagtttg 

ccagctgaga gcgccattac cgccatgtta cgggtgcatg 

ttgccgaata cgccacaggt aactttacgg attagtgtgg 

caaatgagtc actatcgtga gtggttgaaa tcggcagatt 

aaagccggac gtaaccgcac cgaagtggcg gcctga 

<210> 221 
• <211> 1404 

<212> DRA . . 
<213> E. Coli 



gcgtaatggg 
tgatgggggc 
gccttgtcac 
ggtggctctc 
agacggcaac 
gcatgacggg 
actgtcggcg 
gcat caacga 
agttacaaat 
cagtaatcat 
aagggct aaa 
gggttgcgcc 
tggcgctt ta 



cgtaaacgtg 
aggcggcccc 
cctcgagctg 
cctt cccatt 
caaactggcg 
cgtgtataac 
gcataatcgc 
tacctggggc 
taccctgcgc 
gtccggtacg 
tacattacgt 
gctgaaccca 
caaagcaaag 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1116 



<400 
t tggatgt ga 
accagcggca 
agcagcgatt 
tctaactacg 
aacggcgacg 
gcaaccggta 
tacaaagata 
gctaacaaag 
gttctgcaac 
aacaccaata 
catggcctgg 
gacaacggca 
aaaattgacg 
gacatgaatg 
tctgct cact 
aacgacctgt 
gcttggggct 
actccttacg 
gacatgaaag 
ggttatacct 
gtctacgacg 
gaagggtctg 
gcctataccg 
gtgggtgtta 



> 221 
acgttgatca 
acatcgctga 
acgttctgaa 
gttacggtat 
tagacaacgg 
actacaaagt 
aagaaattat 
ct gacctggg 
agatggagct 
tctggaacct 
ctgataacgg . 
ccatcaacta 
gtaacaacgc 
accgttctgg 
tcgcgaacaa 
ctgcaaccat 
tcggtttgaa 
gcagcgtttc 
ttgacggtca 
tcacctacag 
actctaacaa 
cggtacgtgt 
atgctaacta 
aatatacctg 



gttcgatact 
ccataacggt 
cgctgatct g 
tgttgctatg 
tactgaactg 
tcgtatcgac 
ct acgtaaac 
t gcatacacc 
gaccgactac 
ggaacaagac 
cggcgcatgg 
tgatcaggat 
t aagtggatc 
tcaggtggat 
cgtctttgtt 
gagcaacggt 
agccggttac 
tggtctgttc 
gt ctt acgac 
cgaagatcag 
cgataacgat 
tggtctgggt 
cctcggtggt 
gtaa 



gaagctttcc 
aacgtagtat 
gtgaacgacc 
aactctgatg 
gataacagct 
aacgcaactg 
gacgtcaaca 
tatcaggctg 
gctaacatgg 
accgttggta 
gtaagctact 
gttaacggca 
gt cggtgcgg 
caagacagcc 
gatggtagct 
acttacgttg 
gacttcaaac 
cagtctggtg 
agcatgcgtt 
gctctgactc 
gtgaacggcg 
actcagttta 
ggtgacgtag 



gt actgacaa 
ctggtgtgtt 
gtacctggga 
gt cacctgac 
ct gtagacaa 
gcgctggcgc 
gcaacgcgac 
aacagcgcgg 
cgctgagcat 
ctcgt ctgac 
tcggtggtaa 
tcatggtcgg 
ctgcaggctt 
agactgccta 
tgagctactc 
acggtagcac 
tgggtgatgc 
atgactacca 
atgaactggg 
cgtacttcaa 
attccat cga 
gcttcaccaa 
at caagactg 



act ggaactg 

cgatatccat 

tacttccaag 

tatcaacggt 

tgttgttgct 

tat cgctgat 

cttctct get 

taacaccgtt 

cccgtctgcg 

caactct cgt 

ctt caaegge 

tgttgatacc 

cgctaaaggt 

catctactct ■ 

tcacttcaac 

caact ccgac 

tggttacgtg 

gctgagcaac 

tgtagatgea 

actggcttac 

taacggt act 

gaact tcagc 

gtccgcgaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1404 



<210> 222 
<211> 669 
<212> DNA 
<213> E . Coli 



<400 
atgcccgtca 
aaacctcttc 
gagtttaata 
gttattttag 
caggctcctt 
ttaatatcag 
attgaacaaa 
ctggaattac 
ttattatcaa 



> 222 
aggatttgac 
aggaatttgg 
atgaaaagca 
agggagttat 
atattatggg 
aaggaaattg 
atcaactctg 
gcgacgtgca 
tgattgactg 



gggcattact 
taagctcgat 
agttatattt 
ctctctgcgt 
gctggctgat 
tacgggatat 
gegagacget 
gctcat tggg 
gaatgaagaa 



gcaaaggacg 
aaatgtttgt 
tccagtgatg 
agagaagaaa 
ggtttaatga 
catctaccag 
ttttactggt 
cataattcct 
t t gcgatcac 



cgcaaatgt t 

ecagataegg 

t caat aacga 

aegtacttat 

aaaacgatat 

ccaaacaaac 

tagectggea ■ 

acgaacaaat 

gtattggtgt 



atctgtagtt 
tacgcgcttc 
agatact tt c 
eggtatt acc 
accatacaaa 
cattacgett 
aaatagaatt 
ccgcgcaaca 
gatgaat tat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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atccatcaac gtacacgcat atcgcgttct gtcgtcgcag aagttctcgc tgctttgcgt 600 
aaaggcggct atatcgaaat gaataaaggc aaactggtcg ctatcaaccg tttgccttca 660 
gagtattaa 669 

<210> 223 

<211> 255 

<212> DNA 

<213> E. Coli 

<400> 223 

atgaccgata aaatccgtac tctgcaaggt cgcgttgtta gcgacaaaat ggagaaatcc 60 

attgttgttg ctatcgaacg ttttgtgaaa cacccgatct acggtaaatt catcaagcgt 120 

acgaccaaac tgcacgtaca tgacgagaac aacgaatgcg gtatcggtga cgtggttgaa 180 

atccgcgaat gccgtccgct gtccaagact aaatcctgga cgctggttcg cgttgtagag 240 

aaagcggttc tgtaa ~ " 255 

<210> 224 
<211> 192 
<212> DNA 
<213>. E. Coli 

<400> 224 

atgaaagcaa aagagctgcg tgagaagagc gttgaagagc tgaacaccga gctgctgaac 60 

ctgctgcgtg agcagttcaa cctgcgtatg caggctgcaa gtggccagct gcaacagtct 120 

cacctgttga agcaagtgcg tcgcgatgtc gcacgcgtta agactttact gaacgagaag 180 

gcgggtgcgt aa " " 192 

<210> 225 
<211> 411 
<212> DNA ' 
<213> E. Coli 



<400> 225 

atgttacaac * caaagcgtac aaaattccgt aaaatgcaca aaggccgtaa 
gcgcagggta cggatgttag cttcggcagc ttcggtctga aagctgttgg 
ctgactgccc gtcagatcga agcagcacgt cgtgctatga cccgtgcagt 
ggtaagatct ggatccgtgt gttcccggac aaaccgatca ctgaaaagcc 
cgtatgggta aaggtaaagg taacgtggag tattgggttg ccttgattca 
gtcctgtatg aaatggacgg tgttccggaa gagctggccc gtgaagcatt 
gcagcgaaac tgccgattaa aaccaccttt gtaactaaga cggtgatgta 

<210> 226 
<211> 702 
<212> DNA 
<213> E. Coli 



ccgcggtctg 
ccgtggtcgt 
taagcgtcaa 
gctggcagtg 
gccgggtaaa 
caagctggca 
a 



60 
120 
180 
240 
300 
360 
411 



<400 
atgggt caga 
acctggtt-tg 
cagtacctga 
gctaagagca 
ggtgaagacg 
•atcaacatcg 
acttctcagc 
gcaatgcgtc 
gaaatcgcac 
gacatcgact 
tggatcttca 
gctgctcagc 



> 226 
aagtacat cc 
cgaacaccaa 
ctaaggaact 
tccgtgtaac 
t agaaaaact 
ccgaagttcg 
tggaacgtcg 
tgggcgctaa 
gtaccgaatg 
acaacacctc 
aaggcgagat 
ctaaaaagca 



taatggtatt 
agaattcgct 
ggctaaagcg 
cattcacact 
gcgtaaggtc 
taagcctgaa 
cgttatgttc 
aggtattaaa 
gtaccgcgaa 
tgaagcgcac 
cctgggtggt 
gcagcgtaaa 



cgcctgggta 
gacaacctgg 
tccgtatctc 
gctcgcccgg 
gtagcggaca 
ctggacgcaa 
cgtcgtgcta 
gttgaagtta 
ggtcgcgtac 
accacttacg 
atggctgctg 
ggccgtaaat 



ttgtaaaacc 
acagcgattt 
gtatcgttat 
gtatcgttat 
tcgctggcgt 
aactggttgc 
tgaagcgtgc 
gcggccgtct 
cgctgcacac 
gtgtaat egg 
ttgaacaacc 
aa 



atggaact ct 
taaagtacgt 
cgagcgtccg 
eggtaaaaaa 
tcctgcacag 
tgacagcatc 
t gtacagaac 
gggcggcgcg 
tetgegtget 
cgttaaagtg 
ggaaaaaccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
702 



<210> 227 
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<400> 227 

atggaaacta tcgctaaaca tcgccatgct cgttcttctg ctcagaaggt tcgccttgtt 60 

gctgacctga ttcgcggtaa gaaagtgtcg caggctctgg atattttgac ctacaccaac ' 120 

aagaaagcgg ctgtactggt caagaaagtt ctggaatctg ccattgctaa cgctgaacac 180 

aacgatggcg ctgacattga cgatctgaaa gttacgaaaa ttttcgtaga cgaaggcccg 240 

agcatgaagc gcattatgcc gcgtgcaaaa ggtcgtgcag atcgcatcct gaagcgcacc 300 

agccacatca ctgtggttgt gtccgatcgc tga 333 

<210> 228 
<211> 279 
<212> DNA 
<213> E. Coli 

<400> 228 

atgccacgtt ctctcaagaa aggtcctttt attgacctgc acttgctgaa gaaggtagag 60 

aaagcggtgg aaagcggaga caagaagccc ctgcgcactt ggtcccgtcg ttcaacgatc 120 

tttcctaaca tgatcggttt gaccatcgct gtccataatg gtcgtcagca cgttccggta 180 

tttgtaaccg acgaaatggt tggtcacaaa ctgggtgaat tcgcaccgac tcgtacttat 240 

cgcggccacg ctgctgataa aaaagcgaag aagaaataa 279 

<210> 229 

<211> 822 

<212> DNA 

<213> E. Coli 



<400> 229 

atggcagttg ttaaatgtaa accgacatct. ccgggtcgtc^gccacgtagt taaagtggtt 

aaccctg-agc tgcacaaggg .caaacctttt gctccgttgc tggaaaaaaa cagcaaatcc 

ggtggtcgta acaacaatgg ccgtatcacc actcgtcata tcggtggtgg ccacaagcag 

gcttaccgta ttgttgactt caaacgcaac aaagacggta tcccggcagt tgttgaacgt 

cttgagtacg atccgaaccg ttccgcgaac atcgcgctgg ttctgtacaa agacggtgaa 

cgccgttaca tcctggcccc taaaggcctg aaagctggcg accagattca gtctggcgtt 

gatgctgcaa tcaaaccagg taacaccctg ccgatgcgca acatcccggt tggttctact 

gttcataacg tagaaatgaa accaggtaaa ggcggtcagc tggcacgttc cgctggtact 

tacgttcaga tcgttgctcg tgatggtgct tatgtcaccc tgcgtctgcg ttctggtgaa 

atgcgtaaag tagaagcaga ctgccgtgca actctgggcg aagttggcaa tgctgagcat 

atgctgcgcg ttctgggtaa agcaggtgct gcacgctggc gtggtgttcg tccgaccgtt 

cgcggtaccg cgatgaaccc ggtagaccac ccacatggtg gtggtgaagg tcgtaacttt 

gg-taagcacc cggtaactcc gtggggcgtt cagaccaaag gtaagaagac ccgcagcaac 

aagcgtactg ataaattcat cgtacgtcgc ' cgtagcaaat aa 

<210> 230 
<211> 303 
<212> DNA 
<213> E. Coli 

<400> 230 

atgattcgtg aagaacgtct gctgaaggtg ctgcgtgcac cgcacgtttc tgaaaaagcg 

tctactgcga tggaaaaatc caacaccatc gtactcaaag ttgctaaaga cgcgaccaaa 

gcagaaatca aagctgctgt gcagaaactg tttgaagtcg aagtcgaagt cgttaacacc 

ctggtagtta aagggaaagt taaacgtcac ggacagcgta tcggtcgtcg tagcgactgg 

aaaaaagctt acgtcaccct gaaagaaggc cagaatctgg acttcgttgg cggcgctgag 
taa " 

<210> 231 
<211> 630 
<212> DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
822 



60 
120 
180 
240 
300 
303 
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<213> E. Coli 
<400> 231 

atgattggtt tagtcggtaa aaaagtgggt atgacccgta tcttcacaga agacggcgtt 60 

tctatcccag taaccgtaat cgaagttgaa gcaaaccgcg ttactcaggt taaagacctg 120 

gctaacgatg gctaccgtgc tattcaggtg accaccggtg ctaaaaaagc taaccgtgtg 180 

accaagcctg aagcrggcca cttcgctaaa gctggcgtag aaqcrggccg tggtctgtgg 240 

gaattccgcc tggctgaagg cgaagagttc actgtaggtc agagcattag cgttgaactg 300 

tttgctgacg ttaaaaaagt tgacgtaact ggcacctcta aaggtaaagg tttcgcaggt 360 

accgttaagc gctggaactt ccgtacccag" gacgctactc acggtaactc cttgtctcac 420 

cgcgttccgg gttctatcgg tcagaaccag actccgggca aagtgttcaa aggcaagaaa 480 

atggcaggtc agatgggtaa cgaacgtgta accgttcaga gccttgacgt agtacgcgtt 540 

gacgctgagc gcaacctgct gctggttaaa ggtgctgtcc cgggtgcaac cggtagcgac 600 

ctgatcgtta aaccagctgt gaaggcgtaa 630 

<210> 232 

<211> 606 

<212> DNA 

<213> E . Coli 

<400> 232 

atggaattag tattgaaaga cgcgcagagc gcgctgactg tttccgaaac taccttcggt 60 

cgtgatttca acgaagcgct ggttcaccag gttgttgttg cttatgcagc tggtgctcgt 120 

cagggtactc gtgctcagaa gactcgtgct gaagtaactg gttccggtaa aaaaccgtgg 180 

cgccagaaag gcaccggccg tgcgcgttct ggttctatca agagcccgat ctggcgttct « 240 

ggtggcgtga cctttgctgc tcgtccgcag gaccacagtc aaaaagttaa caagaagatg 300 

taccgcggcg cgctgaaaag catcctgtcc gaactggtac gtcaggatcg tctgatcgtt 360 

gtcgagaagt tctctgtaga agcgccgaaa actaagctgc tggcacagaa actgaaagac 420 

atggctctgg aagatgtgct gatcatcacc ggtgagctgg acgaaaacct gttcctggct 480 

gcgcgcaacc tgcacaaggt tgacgtacgc gatgcaactg gtatcgaccc ggttagcctg 540 

atcgccttcg acaaagtcgt aatgactgct gatgctgtta agcaagttga ggagatgctg 600 

gcatga 606 

<210> 233 
<211> 312 
<212> DNA 
<213> E . Coli 

<400> 233 



atcat cgt ct 


gatcgatcaa 


60 


cgcaggt ccg 


tggtccgatc 


120 


ccccgcacgt 


caacaaagac 


180 


ttgacat cgt 


tgagccaacc 


240 


ccggtgtaga 


cgtgcagatc 


300 






312 



agcctgggtt aa 

<210> 234 

<211> 357 

<212> DNA 

<213> E . Coli 

<400> 234 

atggctcgcg taaaacgtgg tgttattgca cgtgcacgtc acaagaaaat tttgaaacaa 60 

gctaaaggct actacggtgc gcgttctcgc gtataccgcg ttgccttcca ggctgttatc 120 

aaagctggtc agtatgctta ccgtgaccgt cgtcaacgta agcgtcagtt ccgtcaactg 180 

tggattgcgc gtatcaacgc agcagcacgt cagaacggta tttcttacag caaattcatc 240 

aatggcctga aaaaagcctc tgttgaaatc gaccgtaaga tcctggctga tatcgcagta 300 
ttcgacaaag tagcgttcac cgctctggtt gaaaaagcga aagcagctct ggcataa • • 357 

<210> 235 
<211> 198 
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<400> 235 

atgccaaaaa ttaagaccgt acgcggtgct gctaagcgct tcaaaaaaac cggtaaaggt 
ggttttaagc acaagcacgc taacctgcgt cacattctga ccaaaaaagc gaccaaacgt 
aaacgtcacc tgcgtccgaa agccatggtt tccaaaggcg atctgggcct ggtaatcgcg 
tgcctgccgt acgcataa 

<210> 236 
<211> 543 
<212> DNA 
<213> E . Coli 



60 
120 
180 
198 



<400 
at taaaggcg 
gcccaggaag 
gaagctctgg 
gagccgccgg 
t ct aaggaac 
ggtacagatg 
ggtgataaag 
ggtatggaag 
ttcccaacga 
taa 



> 236 
gaaaacgagt 
ttcgcttaac 
agaaagcaga 
tttgtcgtat 
agaagaaaaa 
aaggcgacta 
ccaaaatcac 
tgcttaatcg 
agatcgaagg 



tcaaacggcg 
aggtctggaa 
agaagccgga 
aatggattac 
gcaaaaagtt 
tcaggtaaaa 
gctgcgtttc 
cgtgaaagac 
ccgccagatg 



cgccctaacc 
ggcgagcagc 
gtagacttag 
ggcaaattcc 
atccaggtt a 
ctccgcagcc 
cgcggtcgtg 
gatttgeaag 
atcatggtgc 



gt atcaatgg 
ttggtattgt 
tcgagat cag 
t ctatgaaaa 
aggaaattaa 
tgattcgctt 
agatggcgca 
aactggcagt 
t cgctcctaa 



cgaaat 
gagtct 
ccctaa 
gagcaa 
attccg 
tctcga 
ccagca 
ggt cga 
gaagaa 



tcgc 
gaga 
cgcc 
gtct 
tcct 
agag 
aatc 
atcc 
acag 



<210> 237 
<211> 1929 
<212> DNA 
<213> E . Coli 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
543 



<400 
atgcctgtta 
atggatgttg 
aatggcgaac 
accgccaaag 
cacgcgatta 
ggtttttatt 
gagaagcgga 
tggcacgaag 
gacgaaaaca 
atgtgccgcg 
acggcagggg 
acggcgtggg 
aaacgcgacc 
ccgggtatgg 
gttcgttcta 
cgtgtcctgt 
tcttctgaga 
ttcaaccagg 
tgccaccgta 
caggatgacg 
atccgtttag 
tccactcgtc 
ctggcggttg 
ttctacggtc 
ggtacagtac 
gacaatgaac 
tt catcggta 
caggttgtta 
aaactatcaa 



> 237 
taactcttcc 
cgctggacat 
tggttgatgc 
acgaagaagg 
aacaacttt g 
acgacgttga 
tgcatgagct 
cgcgtgaaac 
tcgcccatga 
gtccgcacgt 
cttactggcg 
cagacaaaaa 
accgtaaaat 
tattctggca 
aactgaaaga 
gggaaaaaac 
accgtgaata 
ggctgaagtc 
acgagccgtc 
cgcatatctt 
tctatgatat 
ctgaaaaacg 
cgctggaaga 
cgaaaat tga 
agctggact.t 
gtaaagtacc 
tcctgaccga 
tcatgaatat • 
atgcgggcat 



tgatggcagc 
tggtccaggt 
ttgcgatctg 
tctggagatc 
gccgcatacc 
tcttgaccgc 
tgctgagaaa 
tttcgccaac 
tgacaagcca 
accgaacatg 
t ggcgacagc 
agcacttaac 
cggtaaacag 
caacgacggc 
gtaccagtat 
cggtcactgg 
ctgcattaag 
ttatcgcgat 
aggttcgctg 
ctgtactgaa 
gtacagcact 
tattggcagc 
aaacaacatc 
atttaccctg 
ctctttgccg 
ggtaatgatt 
agagttcgct 
taccgatt ca 
tcgtgttaaa 



caacgccatt 

ctggcgaaag 

attgaaaacg 

attcgtcact 

aaaatggcaa 

acgttaaccc 

aactacgacg 

cgtggggaga 

ggtctgtact 

cgtttctgcc 

aacaacaaaa 

gcttacctgc . 

ctcgacctgt 

tggaccatct 

caggaagtta 

gacaactaca 

ccgatgaact 

ctgccgctgc 

catggcctga 

gaacaaattc 

tttggcttcg 

gacgaaatgt 

ccgtttgaat 

tatgactgcc 

tctcgtctga 

caccgcgcaa 

ggtttcttcc 

cagtctgaat 

gcagacttga 



acgatcacgc 
cctgtatcgc 
acgcacaact 
cctgtgcgca 
tcggcccggt 
aggaagatgt 
tcattaagaa 
gctacaaagt 
tccatgaaga 
atcatttcaa 
tgttgcaacg 
agcgcctgga 
accatatgca 
tccgtgaact 
aaggtccgtt 
aagatgcaat 
gcccgggtca 
gt atggccga 
tgcgcgtgcg 
gcgatgaagt 
agaagatcgt 
gggatcgtgc 
at caactggg 
tcgatcgtgc 
gcgcttctta 
ttctggggtc 
cgacctggct 
acgttaacga 
gaaatgagaa 



tgtaagcccc 
agggcgcgtt 
gtcgat catt 
cctgtt aggg 
tattgacaac 
cgaagcactc 
gaaagt cage 
ctccattctt 
at atgtcgat 
actaatgaaa 
tatttaeggt 
agaagecgeg 
ggaagaagcg 
ggaagtgttt 
catgatggac 
gttcaccaca 
cgtacaaatt 
gtttggtagc 
tggatttacc 
taacggatgt 
cgt caaactc 
tgaggeggae 
tgaaggeget 
atggcagtgc 
tgt aggegaa 
gatggaacgt 
tgcgccggtt 
attgaegcaa 
gattggcttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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aaaatccgcg agcacacttt gcgtcgcgtc ccatatatgc tggtctgtgg tgataaagag 

gtggaatcag gcaaagttgc cgttcgcacc cgccgtggta aagacctggg aagcatggac 

gtaaatgaag tgatcgagaa gctgcaacaa gagattcgca gccgcagtct taaacaattg 
gaggaataa 



<210> 238 
<211> 1353 
<212> DNA 
<213> E . Coli 



<400 
atgactaaac 
aaccgcgcgg 
acctgcgtaa 
gaagcgatcc 
tgggaaacgt 
aacgtgctcg 
aaaacgctgg 
ggtcgtccga 
ttcgcccttc 
gagctggcgg 
gcgccgctgc 
gaaggcccgc 
agcctgacgc 
attggtcgcg 
gaaaaaggct 
gtgggcgata 
ctctctgaac 
accgtggtct 
gagcagtatg 
gccgtcacca 
aagattgtcg 
gtggcgctga 
acggcggcag 



> 238 
actatgatt a 
ctatgtacgg 
atgttggctg 
atatgtacgg 
tgatcgccag 
gtaaaaataa 
aggtaaacgg 
gccacccgga 
ctgctttgcc 
gcgtgattaa 
gcagcttcga 
agctgcacac 
tggagctgga 
agcctgccaa 
atatcgtcgt 
acacgggtgc 
gcctgtttaa 
tcagccatcc 
gcgacgatca 
ct caccgcca 
gtattcacgg 
agatgggggc 
aagagttcgt 



catcgccatc 
ccagaaatgt 
tgtgccgaaa 
cccggattat 
ccgtaccgcc 
cgttgatgta 
cgaaaccat c 
t at tccgggc 
agagcgcgtg 
cggcctcggc 
cccgatgatt 
caacgccat c 
agatggtcgc 
tgacaacatc 
cgataaatat 
agtggagctg 
taacaagccg 
gccgattggt 
ggtgaaagtg 
gccgtgccgc 
cattggcttt 
aaccaaaaaa 
gacaatgcgt 



ggcggcggca 
gcgctgattg 
aaagtgatgt 
ggttttgata 
tatatcgacc 
atcaaaggct 
acggccgatc 
gtggaatacg 
gcggttgttg 
gcgaaaacgc 
t ccgaaacgc 
ccgaaagcgg 
agtgaaacgg 
aacctggaag 
caaaacacca 
acaccggtgg 
gatgagcatc 
actgttggtt 
tataaatcct 
atgaagctgg 
ggtatggacg 
gacttcgaca 
taa 



gcggcggtat 
aagccaaaga 
ggcacgcggc 
ccactatcaa 
gtattcatac 
ttgcccgctt 
atattctgat 
gtattgattc 
gcgcgggtta 
atctgtttgt 
tggtcgaagt 
tagtgaaaaa 
tggattgcct 
ccgctggcgt 
atattgaagg 
cagttgcagc 
tggattacag 
t aacggaacc 
ctttcaccgc 
tgtgcgttgg 
aaatgtt gca 
ataccgtcgc 



cgcct ccatc 
gctgggcggc 
gcaaatccgt 
taaattcaac 
tt cct atgaa 
cgttgatgcc 
cgccacaggc 
tgatggcttc 
catcgccgtt 
gcgtaaacat 
gatgaacgcc 
taccgatggt 
gatttgggcg 
taaaactaac 
t atttacgcg 
gggtcgccgt 
caacattccg 
gcaggcgcgc 
gatgtatacc 
atcggaagag 
gggcttcgcg 
cattcaccca 



1800 
1860 
1920 
1929 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1353 



<210> 239 
<211> 2904 
<212> DNA 
<213> E . Coli 



<400 
aaggttaagc 
cccggcctat 
aactcatctc 
agctaccggg 
cctctcgtac 
actgtctcac 
ccttgggacc 
cgtcgatatg 
gagcgatggc 
gcgccgtcac 
ccaggattag 
aactacccac 
ttaaagggtg 
ccacctatcc 
ggtctttccg 
ctcgggtgga 
aaggaatttc 
caagagcttc 
ccgtatacgt 
gctggtatct 



> 239 
ctcacggttc 
caacgtcgtc 
ggggcaagtt 
cagtgccatt 
taggagcagc 
gacgttctaa 
tacttcagcc 
aactcttggg 
ccttccattc 
gctcgcagtc 
ccaaccttcg 
cagacactgt 
gtatttcaag 
t acacatcaa 
tcttgccgcg 
gacagcctgg 
gctaccttag 
gcttgcgcta 
ccactttcgt 
tcgactgatt 



attagtaccg 
gt cttcaacg 
tcgtgcttag 
ggcatgacaa 
ccccct cagt 
acccagctcg 
ccaggatgtg 
cggtatcagc 
agaaccaccg 
aagctggctt 
tgctcctccg 
ccgcaacccg 
gt cggctcca 
ggctcaatgt 
ggtacactgc 
ccat catt ac 
gaccgttata 
accccatcaa 
gtttgcacag 
tcagctccat 



gttagctcaa 
ttccttcagg 
atgctttcag 
cccgaacacc 
tctccagcgc 
cgtaccactt 
atgagccgac 
ctgttatccc 
gat cactatg 
atgccattgc 
ttactcttta 
gattacgggt 
tgcagactgg 
tcagtgtcaa 
atcttcacag 
gecattcgtg 
gttacggccg 
ttaaccttcc 
tgctgtgttt 
ccgcgaggga 



cgcatcgctg 
accct t aaag 
cacttatctc 
agtgatgcgt 
ccacggcaga 
taaatggcga 
atcgaggtgc 
cggagtacct 
acctgctttc 
actaacctcc 
ggaggagacc 
caacgttaga 
cgtccacact 
gctatagtaa 
cgagttcaat 
caggtcggaa 
ccgtttaccg 
ggcaccgggc 
tt aat aaaca 
cctcacctac 



cg'cttacaca 
ggtcagggag 
ttccgcattt 
ccactccggt 
tagggaccga 
acagccatac 
caaacaccgc 
tttatccgtt 
gcacctgctc 
tgatgtccga 
gccccagt ca 
acatcaaaca 
t caaagcctc 
aggttcacgg 
ttcact gagt 
ctt acccgac 
gggcttcgat 
aggcgtcaca 
gttgcagcca 
atatcagcgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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gccttctccc gaag^racgg caccattttg cctagttcct 

gccttggtat tctctacctg accacctgtg tcggtttggg 

atgcttagag gcttt-tcctg gaagcagggc atttgttgct 

catcacgcct cagccttgat tttccggatt tgcctggaaa 

cgggacaacc gtcgcccggc caacatagcc ttctccgtcc 

gtacaggaat attaacctgt ttcccatcga ctacgccttt 

actcaccctg ccccgattaa cgttggacag gaacccttgg 

tcacccgctt tatcgttact tatgtcagca ttcgcacttc 

acagcacacc ttcgcaggct tacagaacgc tcccctaccc 

gccgcagctt cggtgcatgg tttagccccg ttacatcttc 

gtgagctatt acgctt tctt" taaatgatgg ctgcttctaa 

ggccttccca catcgtttcc cacttaacca tgactttggg 

ttgtttccct cttcacgacg gacgttagca cccgccgtgt 

ccggtattcg cagtttgcat cgggttggta agtcgggatg 

ctctaccccc ggagatgaat tcacgaggcg ctacctaaat 

tatctcccgg tttgattggc ctttcacccc cagccacaag 

attagtcggt tcggtcctcc agttagtgtt .acccaacctt 

caccgggttt cgggtctata ccctgcaact taacgcccag 

ggctccccta ttcggttaac cttgctacag aatataagtc 

gtacgcagtc acacgcctaa gcgtgctccc actgcttgta 

ttttcactcc cctcgccggg gttcttttcg cctttccctc 

gtcagtcagg agtatttagc cttggaggat ggtcccccca 

tgtcccgccc tactcatcga gctcacagca tgtgcatttt 

tgtatcgcgc gcctttccag acgcttccac taacacacac 

gctccccgtt cgctcgccgc tactggggga atctcggttg 

ttagatgttt cagttccccc ggttcgcctc attaacctat 

tgtcgaaaca cactgggttt ccccattcgg aaatcgccgg 

ttaccgacgc ttatcgcaga ttagcacgtc cttcatcgcc 
ccgtgtacgc ttagtcgctt aacc 



tcacccgagt 
gtacgatttg 
tcagcaccgt 
accagcctac 
ccccttcgca 
cggcctcgcc 
tctt ccggcg 
tgatacctcc 
aacaacgcat 
cgcgcaggcc 
gccaacatcc 
accttagctg 
gtctcccgtg 
acccccttgc 
agctttcggg 
tcatccgct a 
caacctgccc 
ttaagactcg 
gctqacccat 
cgtacacggt 
acggtactgg 
tatt cagaca 
tgtgtacggg 
actgattcag 
atttcttttc 
ggattcagtt 
ttat aacggt 
tctgactgcc 



tctctcaagc 
atgttacctg 
agtgcctcgt 
acgctt aaac 
gtaacaccaa 
ttaggggtcg 
agcgggcttt 
agcatgcctc 
aagcgt cgct 
gactcgacca 
tggctgtctg 
gcggtctggg 
ataacattct 
cgaaacagtg 
gagaa'ccagc 
attttt caac 
atggctagat 
gtttcccttc 
tatacaaaag 
ttcaggttct 
t t cact atcg 
ggataccacg 
gctgtcaccc 
gctctgggct 
ctcggggtac 
aatgatagtg 
t catatcacc 
agggcatcca 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2904 



<210> 240. 
<211> 120 , 
<212> DNA 
<213> E . Coli 

<400> 240 

atgcctggca gttccctact ctcgcatggg gagaccccac actaccatcg gcgctacggc 60 
gtttcacttc tgagttcggc atggggtcag gtgggaccac cgcgctacgg ccgccaggca 120 

<210> 241 
<211> 76 
<212> DNA 
<213> E. Coli 



<400> 241 

gtccccttcg tctagaggcc caggacaccg ccctttcacg gcggtaacag gggttcgaat 60 
cccctagggg acgcca - 75 



<210> 242 
<211> 1549 
<212> DNA 
<213> E. Coli 



<400> 242 

aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 60 

gtcgaacggt aacaggaagc agcttgctgc ttcgctgacg agtggcggac gggtgagtaa 120 

tgtctgggaa gctgcctgat ggagggggat aactactgga aacggtagct aataccgcat 180 

aatgtcgcaa gaccaaagag ggggaccttc gggcctcttg ccatcggatg tgcccagatg 240 

ggattagctt gttggtgggg taacggctca ccaaggcgac gatccctagc tggtctgaga 300 

ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 360 

ggaatattgc acaatgggcg caagcctgat gcagccatgc cgcgtgtatg aagaaggcct 420 
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tcgggttgta .aagtactttc agcggggagg aagggagtaa agttaatacc tttgctcatt 480 

gacgttaccc gcagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 540 

ggtgcaagcg ttaatcggaa ttactgggcg taaagcgcac gcaggcgggt tggttaagtc 600 

agatgtgaaa tccccgggct caacctggga actgcatctg atactggcaa gcttgagtct ■ -660 

cgtagagggg ggtagaattc caggtgtagc ggtgaaatgc gtagagatct ggaggaatac. 720 

cggtggcgaa ggcggccccc tggacgaaga ctgacgctca ggtgcgaaag cgtggggagc 780 

aaacaggatt agataccctg gtagtccacg ccgtaaacga tgtcgacttg gaggttgtgc 840 

ccttgaggcg tggcttccgg agctaacgcg ttaagtcgac cgcctgggga gtacggccgc 900 

aaggttaaaa ctcaaatgaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 960 

ttcgatgcaa cgcgaagaac cttacctggt cttgacatcc acggaagttt tcagagatga 1020 

gaatgtgcct tcgggaaccg tgagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg 1080 

aaatgttggg ttaagtcccg caacgagcgc aacccttatc ctttgttgcc agcggtccgg 1140 

ccgggaactc aaaggagact gccagtgata aactggagga aggtggggat gacgtcaagt 1200 

catcatggcc cttacgacca gggctacaca cgtgctacaa tggcgcatac aaagagaagc 1260 

gacctcgcga gagcaagcgg acctcataaa gtgcgtcgta gtccggattg gagtctgcaa 1320 

ctcgactcca tgaagtcgga atcgctagta atcgtggatc agaatgccac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accatgggag tgggttgcaa aagaagtagg 1440 

tagcttaacc ttcgggaggg cgcttaccac tttgtgattc atgactgggg tgaagtcgta 1500 

acaaggtaac cgraggggaa cctgcggttg gatcacctcc ttaccttaa 154 9 



<210> 243 

<211> 221 

<212> PRT 

<213> E. Coli 
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210 215 220 



<210> 244 
<211> 203 
<212> PRT 
<213> E. Coli 
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Arg Thr Phe Asn Asp His Val Trp Ala Asn Asn Lys Asp Asn Tyr Arg 

4 50 4 55 4 60 

Arg Asp Glu Asn Asp Val Tyr Asp He Ala Asp Tyr Tyr Gin Asn Asp 
465 470 475 480 

Phe Gly Arg Lys Asn Ser Phe Ser Ala Asn Met Ser Gin Ser Leu Pro 

485 490 495 

Glu Gly Trp Gly Ser Val Ser Leu Ser Thr Leu Trp Arg Asp Tyr Trp 

500 505 510 

Gly Arg Ser Gl.y Ser Ser Lys Asp Tyr Gin Leu Ser Tyr Ser Asn Asn 

515 520 525 

Leu Arg Arg He Ser Tyr Thr Leu Ala Ala Ser Gin Ala Tyr Asp Glu 

530 535 . 540 

Asn His His Glu Glu Lys Arg Phe Asn He Phe He Ser lie Pro Phe 
545 550 555 560 

Asp Trp Gly Asp Asp Val Ser Thr Pro Arg Arg Gin He Tyr Met Ser 

565 570 575 

Asn Ser Thr Thr Phe Asp Asp Gin Gly Phe Ala Ser Asn Asn Thr Gly 

580 585 590 

Leu Ser Gly Thr Val Gly Ser Arg Asp Gin Phe Asn Tyr Gly Val Asn 

595 600 605 

Leu Ser His Gin His Gin Gly Asn Glu Thr Thr Ala Gly Ala Asn Leu 

610 615 620 

Thr Trp Asn Ala Pro Val Ala Thr Val Asn Gly Ser Tyr Ser Gin Ser 
625 630 635 640 

Ser Thr Tyr Arg Gin Ala Gly Ala Ser Val Ser Gly Gly He Val Ala 

645 650 655 

Trp Ser Gly Gly Val Asn Leu Ala Asn Arg Leu Ser Glu Thr Phe Ala 

660 665 670 

Val Met Asn Ala Pro Gly He Lys Asp Ala Tyr Val Asn Gly Gin Lys 

675 680 " 685 

Tyr Arg Thr Thr Asn Arg Asn Gly Val Val He Tyr Asp Gly Met Thr 

690 695 700 

Pro Tyr Arg Glu Asn His Leu Met Leu Asp Val Ser Gin Ser Asp Ser 
705 710 715 720 

Glu Ala Glu Leu Arg Gly Asn Arg Lys He Ala Ala Pro Tyr Arg Gly 

725 730 735 

Ala Val Val Leu Val Asn Phe Asp Thr Asp Gin Arg Lys Pro Trp Phe 

740 745 ~ 750 

He Lys Ala Leu Arg Ala Asp Gly Gin Ser Leu Thr Phe Gly Tyr Glu 

"755 760 765 

Val Asn Asp He His Gly His Asn lie Gly Val Val Gly Gin Gly Ser 

770 775 780 

Gin Leu Phe lie Arg Thr Asn Glu Val Pro Pro Ser Val Asn Val Ala 
? 85 790 795 800 

lie Asp Lys Gin Gin Gly Leu Ser Cys Thr He Thr Phe Gly Lys Glu 

805 810 815 

lie Asp Glu Ser Arg Asn Tyr lie Cys Gin 
820 825 
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Met Ala Ala lie Pro Trp Arg Pro Phe Asn Leu Arg Gly lie Lys Met 

1 5 10 15 

Lys Gly Leu Leu Ser Leu Leu He Phe Ser Met Val Leu Pro Ala His 

20 25 30 

Ala Gly He Val He Tyr Gly Thr Arg lie lie Tyr Pro Ala Glu Asn 
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35 4 0 4 5 

Lys Glu Vai Met Vai Gin Leu Met Asn Gin Gly Asn Arg Ser Ser Leu 

50 55 60 

Leu Gin Ala Trp lie Asp Asp Gly Asp Thr Ser Leu Pro Pro Glu Lys 
65 70 75 .80 

lie Gin Vai Pro Phe Met Leu Thr Pro Pro Vai Ala Lys lie Gly Ala 

85 90 95 

Asn Ser Gly Gin Gin Vai Lys lie Lys lie Met Pro Asn Lys Leu Pro 

100 105 HO 

Thr Asn Lys Glu Ser lie Phe Tyr Leu Asn Vai Leu Asp lie Pro Pro 

115 120 .125 

Asn Ser Pro Glu Gin Glu Gly Lys Asn Ala Leu Lys Phe Ala Met Gin 

130 135 140 

Asn Arg lie Lys Leu Phe Tyr Arg Pro Ala Gly lie Ala Pro Vai Asn 
.145 150 • 155 160 

Lys Ala Thr Phe Lys Lys Leu Leu Vai Asn Arg Ser Gly Asn Gly Leu 

165 170 " 175 

Vai lie Lys Asn Asp Ser Ala Asn Trp Vai Thr lie Ser Asp Vai Lys 

180 185 190 

Ala Asn Asn- Vai Lys Vai Asn Tyr Glu Thr lie Met lie Ala Pro Leu 

195 200 205 

Glu Ser Gin Ser Vai Asn Vai Lys Ser Asn Asn Ala Asn Asn Trp His 

210 215 220 

Leu Thr lie lie Asp Asp His Gly Asn Tyr lie Ser Asp Lys lie 
225 230 235 ' 
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Met Lys Arg Ser lie lie Ala Ala Ala Vai Phe Ser Ser Phe Phe Met 

1 5 10 15 

Ser Ala Gly Vai Phe' Ala Ala Asp Vai Asp Thr Gly Thr Leu Thr lie 

20 25 30 

Lys Gly Asn lie Ala Glu Ser Pro Cys Lys Phe Glu Ala Gly Gly Asp 

35 4 0 4 5 

Ser Vai Ser lie Asn Met Pro Thr Vai Pro Thr' Ser Vai Phe Glu Gly 

50 55-60 
Lys Ala Lys Tyr Ser Thr Tyr Asp- Asp Ala Vai Gly Vai Thr Ser Ser 
65 70 75 80 

Met Leu Lys lie Ser Cys Pro Lys Glu Vai Ala Gly Vai Lys Leu Ser 

8 5 90 ~ ■ " 95 . 

Leu lie Thr Asn Asp Lys lie Thr Gly Asn Asp Lys Ala lie Ala Ser 

100 105 " 110. 

Ser Asn Asp Thr Vai Gly Tyr Tyr Leu Tyr Leu Gly Asp Asn Ser Asp 

115 120 " • * 125 

Vai Leu Asp Vai Ser Ala Pro Phe Asn lie Glu Ser Tyr Lys Thr Ala 

130 135 140 

Glu Gly Gin Tyr Ala lie Pro Phe Lys Ala Lys Tyr Leu Lys Leu Thr 
145 150 .155 160 

Asp Asn Ser Vai Gin Ser Gly Asp Vai Leu Ser Ser Leu Vai Met Arg 
165 170 175 

Vai Ala Gin Asp 
180 
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1 ' 5 

Asp Val Ala Lys Ala Val lie Ala Gly 

20 . 25 
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35 ~ 40 

Val Gly Gly Ala lie Leu Leu Thr Gly 

50 • 55 

Asn Glu lie Asp Ala. Gin Cys His Leu 
65 70 
lie Arg Asp Gly Leu Lys Arg Gin Gin 
85 

Glu Asn Met Thr Pro Phe Met Tyr Val 
100 105 
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Leu 


Thr 


Gly 


Leu 


Pro 


Gin 


Ser 


Gly 


Gin 


Leu 


Gin 


Val 


Ser 


Trp 


Gly 








340 










345 










350 




Lys 


Asp 


Lys 


Asn 


Ser 


Asn 


Cys 


lie 


Val 


Glu 


Tyr 


Lys 


Leu 


Pro 


Glu 


Val 






355 










3^60 










365 








Ser 


Pro 


Gly 


Thr 


Leu 


Leu 


Asn 


Gin 


Gin 


Thr 


Ala 


He 


Cys. 


Arg 








370 










375 










380 









<210> 300 
<211> 138 
<212> PRT 
<213> E . Coli 

<400> 300 
Met He Ala He Ala Asp He Leu 

1 5 
Val Ala Pro Phe Leu Ala Gly He 
20 

Ala Leu Glu Leu Val Asp His Leu 

35 * 40 

Leu Leu Asp Leu Val Cys Ala Lys 

50 55 • 

Pro Glu Phe Ala Glu Phe Asn Ala 
65 70 
lie Ala Val lie Arg Thr Leu Met 
85 

Asp Leu Pro Glu lie Gly Ser Lys 



Gin Ala Gly Glu Lys Leu Thr Ala 

10 15 

Gin Asn Glu Glu Gin Tyr Thr Gin 

25 30 

Leu Leu Asn Asp Pro Glu Asn Pro 
45 

lie Thr Ala Trp Glu Glu Ser Ala 
60 

Met Ala Gin Ala Met Pro Gly Gly 

75 ' 80 

Asp Gin Tyr Gly Leu Thr Leu Ser 

90 95 

Ser Met Val Ser Arg Val Leu Ser 
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100 105 HO 

Gly Lys Arg Lys Leu Thr Leu Glu His Ala Lys Lys Leu Ala Thr Arg 

115 120 125 

Phe Gly lie Ser Pro Ala Leu Phe lie Asp 
130 135 



<210> 301 

<211> 104 

<212> PRT 

<213> E. Coli 

<400> 301 

Met His Leu lie Thr Gin Lys Ala Leu Lys Asp Ala Ala Glu Lys Tyr 

1 5 10 15 

Pro Gin His Lys Thr Glu Leu Val Ala Leu Gly Asn Thr lie Ala Lys 

20 25 30 

Gly Tyr . Phe Lys Lys Pro Glu Ser Leu Lys Ala Val. Phe Pro Ser Leu 

35 40 45 

Asp Asn Phe Lys Tyr Leu Asp Lys His Tyr Val Phe Asn Val Gly Gly 

50 55 60 

Asn Glu Leu Arg Val Val Ala Met Val Phe Phe Glu Ser Gin Lys Cys 
65 70 .75 80 

Tyr lie Arg Glu Val Met Thr His Lys Glu Tyr Asp Phe Phe Thr Ala 

85 90 . 95 

Val His Arg Thr Lys Gly Lys Lys 
100 



<210> 302 
<211> 2383 , ■ 
<212> PRT 
<213> E. Coli 

<400> 302 

Met Leu Ser Val Phe Thr Phe Phe Arg Cys Ala Arg Lys Gly Ala Phe 

15 10 15 

Met Leu Ala Arg Ser Gly Lys Val Ser Met Ala Thr Lys Lys Arg Ser 

20 25 30 

Gly Glu Glu lie Asn Asp Arg Gin He Leu Cys Gly Met Gly He Lys 

35 4 0 4 5 

Leu Arg Arg Leu Thr Ala Gly lie Cys Leu lie Thr Gin Leu Ala Phe 

50 55 .60 

Pro Met Ala Ala Ala Ala Gin Gly Val Val Asn Ala Ala Thr Gin Gin 
65 70 75 80 

Pro Val Pro Ala Gin He Ala He Ala Asn Ala Asn Thr Val Pro Tyr 

85 90 95 

Thr Leu Gly Ala Leu Glu Ser Ala Gin Ser Val Ala Glu Arg Phe Gly 

100 105 HO 

He Ser Val Ala Glu Leu Arg Lys Leu Asn Gin Phe Arg Thr Phe Ala 

115 120 125 

Arg Ser Phe Asp Asn Val Arg Gin Gly Asp Glu Leu Asp Val Pro Ala 

130 135 140 

Gin Val Ser Glu Lys Lys Leu Thr Pro Pro Pro Gly Asn Ser Ser Asp 
■145 150 155 ' 160 

Asn Leu Glu Gin Gin He Ala Ser Thr Ser Gin Gin He Gly Ser Leu 

165 170 175 

Leu Ala Glu Asp Met Asn Ser Glu Gin Ala Ala Asn Met Ala Arg Gly 

180 185 190 

Trp Ala Ser Ser Gin Ala Ser Gly Ala Met Thr Asp Trp Leu Ser Arg 
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195 



Phe 


Gly 


Thr 


Ala 


Arg 


lie 


Thr 




210 










215 


Lys 


Asn 


Ser 


Gin 


Phe 


Asp 


Phe 


225 










230 




Asn 


Leu 


Phe 


Phe 


Ser 


Gin 


His 










24 5 






Gin 


He 


Asn 


Asn 


Gly 


Leu 


Gly 








o r\ 
2 60 








Ser 


Gly 


lie 


Asn 


Phe 


Phe 


Phe 






O ^ c 
Z ID 










Arg 


Ala 


Gly 


He 


Gly 


Ala 


Glu 




n n a 

2 90 










one 

2 95 


Ser 


Asn 


Gly 


Tyr 


Leu 


Arg 


Leu 


"3 A C 

305 










310 




Asp 


Asn 


Asp 


Tyr 


Glu 


Ala 


Arg 










325 






Glu 


Ser 


Trp 


Leu 


Pro 


Ala 


Trp 








340 








Glu 


Gin 


Tyr 


Tyr 


Gly 


Asp 


Glu 






355 










Gin 


Ser 


Asn 


Pro 


His 


Ala 


He 




37 0 










37 5 


Pro 


Leu 


Met 


Thr 


Phe 


Ser 


Ala 


o o c 

385 










3 90 




Asn 


Asp 


Thr 


Arg 


Phe 


Ala 


Val 










405 






Met 


Gin 


Lys 


Gin 


Leu 


Asp 


Pro 








420 








Ala 


Gly 


Ser 


Arg 


Tyr 


Asp 


Leu 






4 35 










Glu 


Tyr 


Arg 


Lys 


Lys ,Glu 


Leu 




4 50 










4 55 


Thr 


Gly 


Lys 


Ser 


Gly 


Glu 


Val 


4 65 










470 




Lys 


Tyr 


Ala 


Leu 


Lys 


Gly 


Tyr 










485 






Ala 


Gly 


Gly 


Lys 


Val 


Val 


Thr 








500 








Pro 


Ala 


Tyr 


Arg 


Phe 


Thr 


Ser. 






515 










He 


Glu 


Val 


Thr 


Ala 


Glu 


Asp 




530 










535 


Gin 


Ser 


Met 


Val 


Val 


Val 


Gin 


545 










550 




Ser 


Val 


Ser 


Leu 


Ser 


Thr 


Gin 










565 






Ala 


Thr 


Leu 


Thr 


Phe 


He 


Ala 








580 








Gly 


Leu 


Val 


Leu 


Ser 


Thr 


Arg 






595 










Ser 


Asp 


Trp 


Lys 


Asp 


Asn 


Gly 




610 










615 


Thr 


Gly 


Ala 


Met 


Ser 


Gly 


Thr 


roc 










630 




Val 


Asp 


Ala 


Ala 


Lys 


Ala 


Pro 










645 






•Ser 


Ser 


Arg 


Thr 


His 


Ser 


Ser 








660 








Ser 


Gly 


Asn 


Pro 


He 


Glu 


Val 



675 




2 00 










205 




Leu 


Gl y 


Val 


Asp 


Glu 


Asp 


Phe 










220 






Leu 


His 


Pro 


Trp 


Tyr 


Glu 


Thr 








235 








Thr 


Leu 


His 


Arg 


Thr 


Asp 


Glu 






250 










Trp 


Arg 


His 


Phe 


Thr 


Pro 


Thr 




265 










270 


Asp 


His 


Asp 


Leu 


Ser 


Arg 


Tyr 


2 80 










285 




Tyr 


Trp 


Arg 


Asp 


Tyr 


Leu 


Lys 










300 






Thr 


Asn 


Trp 


Arg 


Ser 


Ala 


Pro 








315 








Pro 


Ala 


Asn 


Gly 


Trp 


Asp 


Val 






330 










Pro 


His 


Leu 


Gly 


Gly 


Lys 


Leu 




345 










350 


Val 


Ala 


Leu 


Phe 


Asp 


Lys 


Asp 


360 










365 




Thr 


Ala 


Gly 


Leu 


Asn 


Tyr 


Thr 










380 






Glu 


Gin 


Arg 


Gin 


Gly 


Lys 


Gin 








395 








Asp 


Phe 


Thr 


Trp 


Gin 


Pro 


Gly 






410 










Asn 


Glu 


Val 


Ala 


Ala 


Arg 


Arg 




425 










430 


Val 


Asp 


Arg 


Asn 


Asn 


Asn 


He 


4 40 










445 




Val 


Arg 


Leu 


Thr 


Leu 


Thr 


Asp 










4 60 




* 


Lys 


Ser 


Leu 


Val 


Ser 


Ser 


Leu 








475 








Asn 


Val 


Glu 


Ala 


Thr 


Ala 


Leu 






4 90 










Thr 


Gly 


Lys 


Asp 


He 


Leu 


Val 




505 










510 


Thr 


Pro 


Glu 


Thr 


Asp 


Asn 


Thr 


520 










525 




Val 


Lys 


Gly 


Asn 


Leu 


Ser 


Asn 










540 






Ala 


Pro 


Thr 


Leu 


Ser 


Gin 


Lys 








555 








Thr 


Leu 


Asn 


Ala 


Asp 


Ser 


His 






570 










His 


Asp 


Ala 


Ala 


Gly 


Asn 


Pro 




585 










590 


His 


Glu 


Gly 


Val 


Gin 


Asp 


He 


600 










605 




Asp 


Gly 


Ser 


Tyr 


Thr 


Gin 


He 










620 






Leu 


Thr 


Leu 


Met 


Pro 


Gin 


Leu 








635 








Ala 


Val 


Val 


Asn 


He 


He 


Ser 






650 










He 


Lys 


He 


Asp 


Lys. 


Asp 


Arg 




665 










670 


Thr 


Val 


Glu 


Leu 


Arg 


Asp 


Glu 


680 










685 
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Ser Leu 

Pro Asp 
240 
Arg • Thr 
255 

Trp Met 

His Ser 

Leu Ser 

Glu Leu 
320 
Arg Ala 
335 

Val Tyr 

Asp Arg 

Pro Phe 

Gly Glu 
400 
Ser Ala 
415 

Ser Leu 

Val Leu 

Pro Val 

Gin Thr 
480 
Glu Ala 
495 

Thr Leu 

Trp Pro 

Arg Glu 

Asp Ser 
560 
Ser Thr 
575 

Val Val 

Thr Leu 

Leu Thr 

Asn Gly 
640 
Val Ser 
655 

Tyr Leu 
Asn Asp 
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Lvs 


Pro 


Val 


Lvs 

y 


Glu 


Gin 


Lvs 

y 


Gin 


Gin 


Leu 


Asn 


As n 


Ala 


Val 


Ser 


He 




690 










695 










700 










Asp 


Asn 


Val 


Lvs 


Pro 


Glv 


Val 


Thr 


Thr 


Asp 


Trn 


Lys 


Glu 


Thr 


Ala 


Asp 


705 










710 










715 










7 9 ri 


Glv 


Val 


Tvr 


Lvs 


Ala 


Thr 


Tvr 


Thr 


Ala 


Tvr 


Thr 


Lys 


Gly 


Ser 


Gly 


Leu 










725 










730 










7 35 




Thr 


Ala 


Lvs 


Leu 


Leu 


Met 


Gin 


Asn 


Trn 


Asn 


Glu 


Asp 


Leu 


His 


Thr 


Ala 








740 










74 5 










750 






Glv 


Phe 


lie 


He 


Asd 


Ala 


Asn 


Pro 


Gin 


Ser 


Ala 


Lys 


He 


Ala 


Thr 


Leu 






755 










760 










7 65 








Ser 


Ala 


Ser 


Asn 


Asn 


Glv 


Val 


Leu 


Ala 


Asn 


Glu 


Asn 


Ala 


Ala 


Asn 


Thr 




770 










775 










780 










Val 


Ser 


Val 


Asn 


Val 


Ala 


Asp 


Glu 


Glv 


Ser 


Asn 


Pro 


He 


Asn 


Asp 


His 


785 










790 










795 










800 


Thr 


Val 


Thr 


Phe 


Ala 


Val 


Leu 


Ser 


Glv 


Ser 


Ala 


Thr 


Ser 


Phe 


Asn 


Asn 










805 










810 










815 




Gin 


As n 


Thr 


Ala 


Lvs 


Thr 


Asp 


Val 


As n 


Glv 


Leu 


Ala 


Thr 


Phe 


Asp 


Leu 








820 










825 










830 






Lys 


Ser 


Ser 


Lys 


Gin 


Glu 


Asp 


Asn 


Thr 


Val 


Glu 


Val 


Thr 


Leu 


Glu 


As n 






835 










840 










845 








Glv 


Val 


Lys 


Gin 


Thr 


Leu 


He 


Val 


Ser 


Phe 


Val 


Gly 


Asp 


Ser 


Ser 


Thr 




850 










855 










860 










Ala 


Gin 


Val 


Asp 


Leu 


Gin 


Lys 


Ser 


Lys 


As n 


Glu 


Val 


Val 


Ala 


Asp 


Gly 


8 65 










8 7 0- 










875 










88 0 


Asn 


Asp 


Ser 


Val 


Thr 


Met 


Thr 


Ala 


Thr 


Val 


Arg 


Asp 


Ala 


Lys 


Gly 


As n 










885 










890 










895 




Le u 


Leu 


As n 


Asp 


Val 


Met 


Val 


Thr 


Phe • 


As n 


Val 


A c n 


iJCl 


Al A 


C\ n 
oxu 


A 1 a 








900 










905 










910 






Lys 


Le u 


Ser 


Gin 


Thr 


Glu 


Val 


Asn 


Ser 


His 




CZ'\ v 
oiy 


Tip 


A 1 A 


Th t~ 

L 11 L 


AT a 






915 










920 










925 








Thr 


Leu 


Thr 


Ser 


Leu 


Lys 


Asn 


Glv 


Asp 


Tvr 


Arg 


Val 


Thr 


Ala 


Ser 


Val 




930 










935 










94 0 










Ser 


Ser 


Glv 


Ser 


Gin 


Ala 


Asn 


Gin 


Gin 


Val 


Asn 


Phe 


He 


Glv 


Asp 


Gin 


945 










950 










955 










960 


Ser 


Thr 


Ala 


Ala 


Leu 


Thr 


Leu 


Ser 


Val 


Pro 


Ser 


Glv 


Asp 


He 


Thr 


Val 










965 










970 










975 




Thr 


Asn 


Thr 


Ala 


Pro 


Gin 


Tyr 


Met 


Thr 


Ala 


Thr 


Leu 


Gin 


Asp 


Lys 


As n 








980 










985 










990 






Gly 


Asn 


Pro 


Leu 


Lys 


Asp 


Lys 


Glu 


He 


Thr 


Phe 


Ser 


Val 


Pro 


As n 


As p 






995 










1000 








1005 






Val 


Ala 


Ser 


Lys 


Phe 


Ser 


He 


Ser 


Asn 


Gly 


Gly 


Lys 


Gly 


Met 


Thr 


Asp 




1010 








1015 








1020 








Ser 


Asn 


Gly 


Val 


Ala 


He 


Ala 


Ser 


Leu 


Thr 


Gly 


Thr 


Leu 


Ala 


Gly 


Thr 


1025 








1030 








1035 








1041 


His 


Met 


He 


Met 


Ala 


Arg 


Leu 


Ala 


Asn 


Ser 


Asn 


Val 


Ser 


Asp 


Ala 


Gin 










1045 








1050 








1055 


Pro 


Met 


Thr 


Phe 


Val 


Ala 


Asp 


Lys 


Asp 


Arg 


Ala 


Val 


Val 


Val 


Leu 


Gin 








1060 








1065 








1070 




Thr 


Ser 


Lys 


Ala 


Glu 


He 


He 


Gly 


Asn 


Gly 


Val 


Asp 


Glu 


Thr 


Thr 


Leu 






1075 








1080 








1085 






Thr 


Ala 


Thr 


Val 


Lys 


Asp 


Pro 


Ser 


Asn 


His 


Pro 


Val 


Ala 


Gly 


lie 


Thr 




1090 








1095 








1100 








Val 


Asn 


Phe 


Thr 


Met 


Pro 


Gin 


Asp 


Val 


Ala 


Ala 


Asn 


Phe 


Thr 


Leu 


Glu 


1105 








1110 








1115 








112< 


Asn 


Asn 


Gly 


He 


Ala 


He 


Thr 


Gin 


Ala 


Asn 


Gly 


Glu 


Ala 


His 


Val 


Thr 










1125 








1130 








1135 


Leu 


Lys 


Gly 


Lys 


Lys 


Ala 


Gly 


Thr 


His 


Thr 


Val 


Thr 


Ala 


Thr 


Leu 


Gly 








1140 








1145 








1150 




Asn 


Asn 


Asn 


Thr 


Ser 


Asp 


Ser 


Gin 


Pro 


Val 


Thr 


Phe 


Val 


Ala 


Asp 


Lys 






1155 








1160 








1165 






Ala 


Ser 


Ala 


Gin 


Val 


Val 


Leu 


Gin 


He 


Ser 


Lys 


Asp 


Glu 


He 


Thr 


Gly 
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1170 1175 1180 

Asn Giy Val Asp Ser Ala Thr Leu Thr Ala Thr Val Lys Asp Gin Phe 
1185 1190 1195 1200 

Asp Asn Glu Val Asn Asn Leu Pro Val Thr Phe Ser Ser Ala Ser Ser 

1205 1210 1215 

Gly Leu Thr Leu Thr Pro Gly Val Ser Asn Thr Asn Glu Ser Gly lie 

1220 1225 1230 

Ala Gin Ala Thr Leu Ala Gly Val Ala Phe Gly Glu Lys Thr Val Thr 

123S 1240 1245 

Ala Ser Leu Ala Asn Asn Gly Ala Ser Asp Asn Lys Thr Val His Phe 

1250 1255 1260 

lie Gly Asp Thr Ala Ala Ala Lys lie lie Glu Leu Ala Pro Val Pro 
1265 . 1270 1275 1280 

Asp Ser He He Ala Gly Thr Pro Gin Asn Ser Ser Gly Ser Val He 

1285 1290 1295 

Thr Ala Thr Val Val Asp Asn Asn Gly Phe Pro Val Lys Gly Val Thr 

1300 1305 1310 

Val Asn Phe Thr Ser Asn Ala Ala Thr Ala Glu Met Thr Asn Gly Gly 

1315 1320 1325 

Gin Ala Val Thr Asn Glu Gin Gly Lys Ala Thr Val Thr Tyr Thr Asn 

1330. 1335 1340 

Thr Arg Ser Ser He Glu Ser Gly Ala Arg Pro Asp Thr Val Glu Ala 
1345 1350 1355 1360 

Ser Leu Glu Asn Gly Ser Ser Thr Leu Ser Thr Ser He Asn Val Asn 

1365 . 1370 1375 

Ala Asp Ala Ser Thr Ala His Leu Thr Leu Leu Gin Ala Leu Phe Asp 

1380 1385 1390 

Thr Val Ser Ala Gly Glu Thr Thr Ser Leu Tyr lie Glu Val Lys Asp 

1395 1400 1405 

Asn Tyr Gly Asn Gly Val Pro Gin Gin Glu Val Thr Leu Ser Val Ser 

1410 1415 ' 1.420 

Pro Ser Glu Gly Val , Thr Pro Ser Asn Asn Ala He Tyr Thr Thr Asn 
1425 : 1430 1435 1440 

His Asp Gly Asn Phe Tyr Ala Ser Phe Thr Ala Thr Lys Ala Gly Val 

1445 " 1450 .1455 

Tyr Gin Leu Thr Ala Thr Leu Glu Asn Gly Asp Ser Met Gin Gin Thr 

1460 1465 1470 

Val Thr Tyr Val Pro Asn Val Ala Asn Ala Glu lie Thr Leu Ala Ala 

1475 1480 1485 

Ser Lys Asp Pro Val He Ala Asp Asn Asn Asp Leu Thr Thr Leu Thr 

1490 1495 .1500 

Ala Thr Val Ala Asp Thr Glu Gly Asn Ala He Ala Asn Thr Glu Val 
1505 1510 1515 1520 

Thr Phe Thr Leu Pro Glu Asp Val Lys Ala Asn Phe Thr Leu Ser Asp 

1525 1530 1535 

Gly Gly Lys Val He Thr Asp Ala Glu Gly Lys Ala Lys Val Thr Leu 

1540 1545 1550 

Lys Gly Thr Lys Ala Gly Ala His Thr Val Thr Ala Ser Met Thr Gly 

1555 1560 1565 

Gly Lys Ser Glu Gin Leu Val Val Asn Phe lie Ala Asp Thr Leu Thr 

1570 1575 1580 

Ala Gin Val Asn Leu Asn Val Thr Glu Asp Asn Phe lie Ala Asn Asn 
"1585 1590 1595 1600 

Val Gly Met Thr Arg Leu Gin Ala Thr Val Thr Asp Gly Asn Gly Asn 

1605 1610 1615 

Pro Leu Ala Asn Glu Ala Val Thr Phe Thr Leu Pro Ala Asp Val Ser 

1620 1625 1630 

Ala Ser Phe Thr Leu Gly Gin Gly Gly Ser Ala lie Thr Asp lie Asn 

1635 1640 1645 

Gly Lys Ala Glu Val Thr Leu Ser Gly Thr Lys Ser Gly Thr Tyr Pro 
1650 1655 1660 
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Val Thr Val Ser Val Asn Asn Tyr Gly Val Ser Asp Thr Lys Gin Val 
1665 1670 1675 " 1680 

Thr Leu lie' Ala Asp Ala Gly Thr Ala Lys Leu Ala Ser Leu Thr Ser 
1685 1690 1695 

• Val Tyr Ser Phe Val Val Ser Thr Thr Glu Gly Ala Thr Met Thr Ala 

1700 1705 ' 1710 

Ser Val Thr Asp Ala Asn Gly Asn Pro Val Glu Gly lie Lys Val Asn 
1715 1720 1725 

' Phe Arg Gly Thr Ser Val Thr Leu Ser Ser Thr Ser Val Glu Thr Asp 
1730 1735 1740 

• Asp Arg Gly Phe Ala Glu lie Leu Val Thr Ser Thr Glu Val Gly Leu 
1745 1750 1755 1760 
Lys Thr Val. Ser Ala Ser Leu Ala Asp Lys Pro Thr Glu Val lie Ser 

1765 1770 1775 

Arg Leu Leu Asn Ala Ser Ala Asp Val Asn Ser Ala Thr lie Thr Ser 

1780 1785 .- 1790 

Leu Glu lie Pro Glu Gly Gin Val Met Val Ala Gin Asp Val Ala Val 

1795 * 1800 1805 

Lys Ala His Val Asn Asp Gin Phe Gly Asn Pro Val Ala His Gin Pro 

1810 1815 1820 

Val Thr Phe Ser Ala' Glu Pro Ser Ser Gin Met lie He Ser Gin Asn 
1825 1830 "1835 .1840 

Thr Val Ser Thr Asn Thr Gin Gly Val Ala Glu Val Thr Met Thr Pro 

1845 1850 1855 

Glu Arg Asn Gly Ser Tyr Met Val Lys Ala Ser Leu Pro Asn Gly Ala 

1860 1865 1870 

Ser Leu Glu Lys Gin Leu Glu Ala He Asp Glu Lys Leu Thr Leu Thr 

1875 1880 1885 

Ala Ser Ser Pro 'Leu lie Gly Val Tyr Ala Pro Thr Gly Ala Thr Leu 

18 90 18 95 1900 

Thr Ala Thr Leu Thr Ser Ala Asn Gly Thr. Pro Val Glu~Gly Gin Val 
1905 ,1910 1915 1920 

lie Asn Phe Ser Val Thr Pro Glu Gly Ala Thr Leu Ser Gly Gly Lys 

1925 "* 1930 1935 

Val Arg Thr Asn Ser Ser Gly Gin Ala Pro Val Val Leu Thr Ser Asn 

1940 1945 1950 

Lys Val Gly Thr Tyr Thr Val Thr Ala Ser Phe His Asn Gly Val Thr 

1955 1960 . 1965 

He Gin Thr Gin Thr Thr Val Lys Val Thr Gly Asn Ser Ser Thr Ala 

1970 1975 1980 

His Val Ala Ser Phe He Ala Asp Pro Ser Thr He Ala Ala Thr Asn 
1985 1990 1995 2000 

Thr Asp Leu Ser Thr Leu Lys Ala Thr Val Glu Asp Gly Ser Gly Asn 

2005 2010 " 2015 

Leu lie Glu Gly Leu Thr Val Tyr Phe Ala Leu Lys Ser Gly Ser Ala 

2020 2025 "* 2030 

Thr Leu Thr Ser Leu Thr Ala Val Thr Asp Gin Asn Gly He Ala Thr 

2035 2040 2045 

Thr Ser Val Lys Gly Ala Met Thr Gly Ser Val Thr Val Ser Ala Val 

2050 2055 2060 

Thr Thr Ala Gly Gly Met Gin Thr Val Asp He Thr Leu Val Ala Gly 
2065 2070 2075 2080 

Pro Ala Asp Thr Ser Gin Ser Val Leu Lys Ser Asn Arg Ser Ser Leu 

2085 2090 ~ 2095 - 

Lys Gly Asp Tyr Thr Asp Ser Ala Glu Leu Arg Leu Val Leu His Asp 

2100 . 2105 " 2110 

He Ser Gly Asn Pro He Lys Val Ser Glu Gly Met Glu Phe Val Gin 

2115 2120 2125 

Ser Gly Thr Asn Val Pro Tyr lie Lys lie Ser Ala lie Asp Tyr Ser 

2130 2135 2140 

Leu Asn lie Asn Gly Asp Tyr Lys Ala Thr Val Thr Gly Gly Gly Glu 
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2145 2150 2155 2160 

Gly lie Ala Thr Leu lie Pro Val Leu Asn Gly Val His Gin Ala Gly 

2165 2170 * 2175 

Leu Ser Thr Thr lie Gln.Phe Thr Arg Ala Glu Asp Lys lie Met Ser 

2180 2185 ^ 2190 

Gly Thr Val Ser Val Asn Gly Thr Asp Leu Pro Thr Thr Thr Phe Pro 

2195 2200 2205 

Ser Gin Gly Phe Thr Gly Ala Tyr Tyr Gin Leu Ash Asn Asp Asn Phe 

2210 2215 2220 

Ala Pro Gly Lys Thr Ala Ala Asp Tyr Glu Phe Ser Ser Ser Ala Ser 
2225 2230 2235 2240 

Trp Val Asp Val Asp Ala Thr Gly Lys Val Thr Phe Lys Asn Val Gly 

2245 2250 2255 

Ser Asn Ser Glu Arg lie Thr Ala Thr Pro Lys Ser Gly Gly Pro Ser 

2260 2265' . " 2270 

Tyr Val Tyr Glu lie Arg Val Lys Ser Trp Trp Val Asn Ala Gly Glu 

2275 2280 2285 

Ala Phe Met lie Tyr Ser Leu Ala Glu Asn Phe Cys Ser Ser Asn Gly 

2290 2295 2300 

Tyr Thr Leu Pro Arg Ala Asn Tyr Leu Asn His Cys Ser Ser Arg Gly 
2305 2310 2315 " 2320 

lie Gly Ser Leu Tyr Ser Glu Trp Gly Asp Met Gly His Tyr Thr Thr 

2325 2330 2335 
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Gly Gly He Arg Glu lie Glu Phe He Val Gin Val Phe Gin Leu lie 

325 330 335 

Arg Gly Gly Arg Glu Pro Ser Leu Gin Ser Arg Ser Leu Leu Pro Thr 

340 345 350 

Leu Ser Ala. He Ala Glu Leu His Leu Leu Ser Glu Asn Asp Ala Glu 

355 360 365 

Gin Leu Arg Val Ala Tyr Leu Phe Leu Arg Arg Leu Glu Asn Leu Leu 

370 375 " 380 

Gin Ser lie Asn Asp Glu Gin Thr . Gin Thr Leu Pro Ser Asp Glu Leu 
385 390 395 400 

Asn Arg Ala Arg- Leu Ala Trp Ala Met Asp Phe Ala Asp Trp Pro Gin 

405 410 ^ * 415 

Leu Thr Gly Ala Leu Thr Ala His Met Thr Asn Val Arg Arg Val Phe 

420 425 430 

Asn Glu Leu He Gly Asp Asp Glu Ser Glu Thr Gin Glu Glu Ser Leu 

435 440 445 

Ser Glu Gin Trp Arg Glu Leu Trp Gin Asp Ala Leu Gin Glu Asp Asp 

450 455 460 

Thr Thr Pro Val. Leu Ala His Leu Ser Glu Asp Asp Arg Lys Gin Val 
4 65 4 70 4 75 . 4 80 

Leu Thr Leu He Ala Asp Phe Arg Lys Glu Leu Asp Lys Arg Thr lie 

485 490 " ' ~ 495 

Gly Pro Arg Gly Arg Gin Val Leu Asp His Leu Met Pro His Leu Leu 

500 505 510 

Ser Asp Val Cys Ala Arg Glu Asp Ala Ala Val Thr Leu Ser Arg lie 

515 520 525 

Thr Ala Leu Leu Val Gly lie Val Thr Arg Thr Thr Tyr Leu Glu Leu 

530 535 540 

Leu Ser Glu Phe Pro Ala Ala Leu Lys His Leu lie Ser Leu Cys Ala 
545 550 555 560 

Ala Ser Pro Met He Ala Ser Gin Leu Ala Arg Tyr Pro Leu Leu Leu 

565 570 ~ 575 

Asp Glu Leu Leu Asp Pro Asn Thr Leu Tyr Gin Pro Thr Ala Thr Asp 

580 585 590 

Ala Tyr Arg Asp Glu Leu Arg Gin Tyr Leu Leu Arg Val Pro Glu Asp 

595 600 . 605 

Asp Glu Glu Gin Gin Leu Glu Ala Leu Arg Gin Phe Lys Gin Ala Gin 
610 615 620 

-Leu Leu Arg lie Ala Ala Ala Asp lie Ala Gly Thr Leu. Pro Val Met 
625 630 635 640 

Lys . Val Ser Asp His Leu Thr Trp Leu Ala Glu Ala Met lie Asp. Ala 

645 * 650 655 

Val Val Gin Gin Ala Trp Val Gin. Met Val Ala Arg^ Tyr Gly Lys Pro 

660 665 . . " ~ 670 

Asn His Leu Asn Glu Arg Glu Gly Arg Gly Phe Ala Val Val Gly Tyr 

675 680 " 685 

Gly Lys Leu Gly Gly Trp Glu Leu Gly Tyr Ser Ser Asp Leu Asp Leu 

690 695 700 

lie Phe Leu His Asp Cys . Pro Met Asp Ala Met Thr Asp Gly Glu Arg 
"705 710 715 720 

Glu lie Asp Gly Arg Gin Phe Tyr Leu Arg. Leu Ala Gin Arg lie Met 

725 730 735 

His Leu Phe Ser Thr Arg Thr Ser Ser Gly He Leu Tyr Glu Val Asp 

740 745 ' 750 

Ala Arg Leu Arg Pro Ser Gly . Ala Ala Gly Met Leu Val Thr Ser Ala 

755 760 765 

Glu Ala Phe Ala Asp Tyr Gin Lys Asn Glu Ala Trp Thr Trp Glu His 

770 775 780 

Gin Ala Leu Val Arg Ala Arg Val Val Tyr Gly Asp Pro Gin Leu Thr 
7 85 790 795 800 

Ala His Phe Asp Ala Val Arg Arg Glu. lie Met Thr Leu Pro Arg Glu 
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<210> 327 
<211> 433 
<212> PRT 
<213> E. Coli 
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<210> 328 
<211> 70 
<212> PRT 
<213> E. Coli 

<400> 328 

Met Ser Gly Lys Met Thr Gly He Val Lys Trp Phe Asn Ala Asp Lys 

1 5 10 15 

Gly Phe Gly Phe He Thr Pro Asp Asp Gly Ser Lys Asp Val Phe Val 

20 25 30 

His Phe Ser Ala He Gin Asn Asp Gly Tyr Lys Ser Leu Asp Glu Gly 

35 4 0 4 5 

Gin Lys Val Ser Phe Thr lie Glu Ser Gly Ala Lys Gly Pro Ala Ala 

50 5 5 60 

Gly Asn Val Thr Ser Leu 
65 70 



<210> 329 
<211> 523 
<212> PRT 
<213> E. Coli 

<400> 329 

Met Arg Asp lie Val Asp Pro Val Phe Ser lie Gly lie Ser Ser Leu 

1 5 10 " 15 

Trp Asp Glu Leu Arg His Met Pro Ala Gly Gly Val Trp Trp Phe Asn 

20 25 1 30 

Val Asp Arg His Glu Asp Ala lie Ser Leu Ala Asn Gin Thr lie Ala 

35 40 45 

Ser Gin Ala Glu Thr Ala His Val Ala Val lie Ser Met Asp Ser Asp 

50 5 5 60 

Pro Ala Lys lie Phe Gin Leu Asp Asp Ser Gin Gly Pro Glu Lys lie 
65 70 75 80 
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<A00> 330 



• Met Thr He Ser Asp lie lie Glu lie He Val Val Cys Ala Leu He 
1 5 10 15 

Phe Phe Pro Leu Gly Tyr Leu Ala Arg His Ser Leu Arg Arg He Arg 

20 25 30 

Asp Thr Leu Arg Leu Phe Phe Ala Lys Pro Arg Tyr Val Lys Pro Ala 
35 4 0 4 5 

■ Gly Thr Leu Arg Arg Thr Glu Lys Ala Arg Ala Thr Lys Lys 



<210> 331 

<211> 559 

<212> PRT 

<213> E . Coli 

<400> 331 

Met Thr Gin Phe Thr Gin Asn Thr Ala Met' Pro Ser Ser Leu Trp Gin 

15 10 15 

Tyr Trp Arg Gly Leu Ser Gly Trp Asn Phe Tyr Phe Leu Val Lys Phe 

20 25 30 

Gly Leu Leu Trp Ala Gly Tyr Leu Asn Phe His Pro Leu Leu Asn Leu 

35 40 45 

Val Phe Ala Ala . Phe Leu Leu Met Pro Leu Pro Arg Tyr Ser Leu His 

50 55 60 

Arg Leu Arg His Trp lie Ala Leu Pro He Gly Phe Ala Leu Phe Trp 
65 70 75 80 

His Asp Thr Trp Leu Pro Gly Pro Glu Ser He Met Ser Gin Gly Ser 

85 90 95 

Gin Val Ala Gly Phe Ser Thr Asp Tyr Leu He Asp Leu Val Thr Arg 

100 105 110 

Phe He Asn Trp Gin Met He Gly Ala He Phe Val Leu Leu Val Ala 

115 120 125 

Trp Leu Phe Leu Ser Gin Trp He Arg He Thr Val Phe Val Val Ala 

130 135 140 

He Leu Leu Trp Leu Asn Val Leu Thr Leu Ala Gly Pro Ser Phe Ser 
145 150 155 160 

Leu Trp Pro Ala Gly Gin Pro Thr Thr Thr Val Thr Thr Thr Gly Gly 

165 170 175 

Asn Ala Ala Ala Thr Val Ala Ala Thr Gly Gly Ala Pro Val Val Gly 

180 185 " 190 

Asp Met Pro Ala Gin Thr Ala Pro Pro Thr Thr Ala Asn Leu Asn Ala 

195 200 205 

Trp Leu Asn Asn Phe Tyr Asn Ala Glu Ala Lys Arg Lys Ser Thr Phe 

210 215 " 220 

Pro Ser Ser Leu Pro Ala Asp Ala Gin Pro Phe Glu Leu Leu Val lie 
225 230 . 235 240 

Asn lie Cys Ser Leu Ser Trp Ser Asp lie Glu Ala Ala Gly Leu Met 

245 250 255 

Ser His Pro Leu Trp Ser His Phe Asp lie Glu Phe Lys Asn Phe Asn 

260 265 270 

Ser Ala Thr Ser Tyr Ser Gly Pro Ala Ala lie Arg Leu Leu Arg Ala 

275 280 285 

Ser Cys Gly Gin Thr Ser His Thr Asn Leu Tyr Gin Pro Ala Asn Asn 

290 295 300 

Asp Cys Tyr Leu Phe Asp Asn Leu Ser Lvs Leu Gly Phe Thr Gin His 
305 ' 310 315 " • 320 

Leu Met Met Gly His Asn Gly Gin Phe Gly Gly Phe Leu Lys Glu Val 
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<210> 332 
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<212> PRT 
<213> E . Coli 



<400> 332 
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<210>333 
<211> 101 
<212> PRT 
<213> E. Coli 

<400> 333 

Met Ser Lys Glu His Thr Thr Glu His Leu Arg Ala Glu Leu Lys Ser 



-158- 



BNSDOCID: <WO 004490SA2_I_> 



WO 00/44906 




PCT/USOO/02200 , 



1 








5 










10 










15 




Leu 


Ser 


Asp 


Thr 


Leu 


Glu 


Glu 


Val 


Leu 


Ser 


Ser 


Ser 


Glv 


Glu 


Lys 


Ser 








20 










25 








30 




Lys 


Glu 


Glu 


Leu 


Ser 


Lys 


He 


Arg 


Ser 


Lys 


Ala 


Glu 


Gin 


Ala 


Leu 


Lvs 






35 










40 










45 






Gin 


Ser 


Arg 


Tyr 


Arg 


Leu 


Gly 


Glu 


Thr 


Gly 


Asp 


Ala 


He 


Ala 


L vs 


Gin 




50 










55 










60 








Thr 


Arq 


Val 


Ala 


Ala 


Ala 


Arq 


Ala 


Asd 


Glu 


Tvr 
j 


Val 


Arg 


Glu 


Asn 


Pro 


65 










70 










75 










80 


Trp 


Thr 


Gly 


Val 


Gly 
85 


He 


Gly 


Ala 


Ala 


He 
90 


Gly 


Val 


Val 


Leu 


Gly 
95 


Val 


Leu 


Leu 


Ser 


Arg 
100 


Arg 

























<210> 334 
<211> 134 
<212> PRT 
<213> E. Coli 





<400> 


334 


























Met 


Ala 


Asp 


Thr 


His 


His 


Ala 


Gin 


Gly 


Pro 


Gly 


Lys 


Ser 


Val 


Leu 


Gly 


1 








5 










10 










15 


He 


Gly 


Gin 


Arg 


He 


Val 


Ser 


He 


Met 


Val 


Glu 


Met 


Val 


Glu 


Thr 


Arg 








20 










25 










30 




Leu 


Arg 


Leu 
35 


Ala 


Val 


Val 


Glu 


Leu 
40 


Glu 


Glu 


Glu 


Lys 


Ala 
45 


Asn 


Leu 


Phe 


Gin 


Leu 


Leu 


Leu 


Met 


Leu 


Gly 


Leu 


Thr 


Met 


Leu 


Phe 


Ala 


Ala 


Phe 


Gly 




50 










55 










60 








Leu 


Met 


Ser 


Leu 


Met 


Val 


Leu 


He 


He 


Trp 


Ala 


Val 


Asp 


Pro 


Gin 


Tyr 


65 










70 










75 










80 


Arg 


Leu 


Asn 


Ala 


Met. 
85 


lie 


Ala 


Thr 


Thr 


Val 
90 


Val 


Leu 


Leu 


Leu 


Leu 
95 


Ala 


Leu 


He 


Gly 


Gly 
100 


He 


Trp 


Thr 


Leu 


Arg 
105 


Lys 


Ser 


Arg 


Lys 


Ser 
110 


Thr 


Leu 


Leu 


Arg 


His 
115 


Thr 


Arg 


His 


Glu 


Leu 
120 


Ala 


Asn 


Asp 


Arg 


Gin 
125 


Leu 


Leu 


Glu 


Glu 


Glu 
130 


Ser 


Arg 


Glu 


Gin 























<210> 335 
<211> 99 
<212> PRT 
<213> E . Coli 



<400> 335 

Met Ser Ser Lys Val Glu Arg Glu Arg Arg Lys Ala Gin Leu Leu Ser 

1 5 " ~ 10 " 15 

Gin He Gin Gin Gin Arg Leu Asp Leu Ser Ala Ser Arg Arg Glu Trp 

20 25 30 

Leu Glu Thr Thr Gly Ala Tyr Asp Arg Arg Trp Asn Met Leu Leu Ser 

35 4 0 4 5 

Leu Arg Ser Trp Ala Leu Val Gly Ser Ser Val Met Ala He Trp Thr 

50 55 60 

He Arg His Pro Asn Met Leu Val Arg Trp Ala Arg Arg Gly Phe Gly 
65 70 75 80 

Val Trp Ser Ala Trp Arg Leu Val Lys Thr Thr Leu Lys Gin Gin Gin 
85 90 " 95 

Leu Arg Gly 
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<210> 336 
<211> 160 
<212> PRT 
<213> E . Coli 

<400> 336 

Met lie Leu Ser lie Asp Ser Asn Asp Ala Asn Thr Ala Pro Leu His 

1 5 10 ' -15 

Lys Lys Thr lie Ser Ser Leu Ser Gly Ala Val Glu Ser Met Met Lys 

20 25 30 

Lys Leu Glu Asp Val Gly Val Leu Val Ala Arg lie Leu Met Pro lie 

35 40 45 

Leu Phe lie Thr Ala Gly Trp Gly Lys lie Thr Gly Tyr Ala Gly Thr 

50 5*5 60 

Gin Gin Tyr Met Glu Ala Met Gly Val Pro Gly Phe Met Leu Pro Leu 
65 70 75 80 

Val lie Leu Leu Glu Phe Gly Gly Gly Leu Ala lie Leu Phe Gly Phe 

85 90 95 

Leu Thr Arg Thr Thr Ala Leu Phe Thr Ala Gly Phe Thr Leu Leu Thr 

100 105 110 

Ala Phe Leu Phe His Ser Asn Phe Ala Glu Gly Val Asn Ser Leu Met 

115 120 " 125 

Phe Met Lys Asn Leu Thr lie Ser Gly Gly Phe Leu Leu Leu Ala lie 

130 135 140 

Thr Gly Pro Gly Ala Tyr Ser lie Asp Arg Leu Leu Asn Lys Lys Trp 
145 150 . 155 " ^ 1€0 
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Gly His Leu He Gly Asp Val Val Leu Arg Thr Leu Ala Thr Tyr Leu 

180 185 190 

Ala Ser Trp Thr Arg Asp Tyr Glu Thr Val Tyr Arg Tyr Gly Gly Glu 

195 200 205 

Glu Phe He He He Val Lys Ala Ala Asn Asp Glu Glu Ala Cys Arg 

210 215 220 

Ala Gly Val Arg He Cys Gin Leu Val Asp Asn His Ala He Thr His 
225 230 235 240 

Ser Glu Gly His lie Asn He Thr Val Thr Ala Gly Val Ser Arg Ala 

245 250 - 255 

Phe Pro Glu Glu Pro Leu Asp Val Val lie Gly Arg Ala Asp Arg Ala 

260 265 270 

Met Tyr Glu Gly Lys Gin Thr Gly Arg Asn Arg Cys Met Phe lie Asp 

275 280 285 

Glu Gin Asn Val lie Asn Arg Val 
290 295 



<210> 338 
<211> 203 
<212> PRT 
<213> E. Coli 

<400> 338 

Met Arg Leu Arg Val Val Pro Gly Phe lie Ser Pro Pro Pro Gly Phe 

1 5 -10 is 

Gly Gly Leu Gly Tyr Thr Pro Thr Ala Arg Ala Cys Val Asn He Ser 

20 25 30 

lie Pro Leu Gin Leu Arg Val lie Asp Met Leu Asp Val Phe Thr Pro 

35 40 45 

Leu Leu Lys Leu Phe Ala Asn Glu Pro Leu Glu Arg Leu Met Tyr Thr 

50 55 60 

lie lie lie Phe Gly Leu Thr Leu Trp Leu lie Pro Lys Glu Phe Thr 
65 70 75 - 80 

Val Ala Phe Asn Ala Tyr Thr Glu lie Pro Trp Leu Phe Gin He lie 

85 90 95 

Val Phe Ala Phe Ser Phe Val Val Ala lie Ser Phe Ser Arg Leu Arg 

100 105 no 

Ala His lie Gin Lys His Tyr Ser Leu Leu Pro Glu Gin Arg Val Leu 

115 120 125 

Leu Arg Leu Ser Glu Lys Glu lie Ala Val Phe Lys Asp Phe Leu Lys 

130 135 140 

Thr Gly Asn Leu He- lie Thr Ser Pro Cys Arg Asn Pro Val Met Lys 
145 150 155 160 

Lys Leu Glu Arg Lys Gly lie lie Gin His Gin Ser Asp Ser Ala Asn 

165 170 175 

Cys Ser Tyr Tyr Leu Val Thr Glu Lys Tyr Ser His Phe Met Lys Leu 

180 185 190 

Phe Trp Asn Ser Arg Ser Arg- Arg Phe Asn Arg 
195 200 



<210> 339 

<211> 58 

<212> PRT 

<213> E. Coli 

<400> 339 

Met Leu Leu Gin Pro Ser Ala Arg Thr Ser Phe Gly Phe Lys Cys Phe 
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1 ^^5 10 is 

Ala- Phe Gly He Arg His Gly Ser Glu Arg Ser He Leu Val Gly Gl" 

20 25 30 

His Ala Ala His Gin Gly Phe Val Val Ala Glu Val Asp Phe Leu His 

35 40 45 

Phe Ala Asn Leu Thr Ser Cys Cys Tyr Val 
50 55 



<210> 340 
<211> 1426 
<212> PRT 
<213> E. Coli 

<400> 340 

Met Ser Gly Lys Pro Ala Ala Arg Gin Gly Asp Met Thr Gin Tyr Gly 

1 5 10 is 

Gly Pro He Val Gin Gly Ser Ala Gly Val Arg He Gly Ala Pro Thr 

20 25 30 

Gly Val Ala Cys Ser Val Cys Pro Gly Gly Met Thr Ser Gly Asn Pro 

35 4 0 4 5 

Val Asn Pro Leu Leu Gly Ala Lys Val Leu Pro Gly Glu Thr Asp Leu 

50 55 60 

Ala Leu Pro Gly Pro Leu Pro Phe lie Leu Ser Arg Thr Tyr Ser Ser 
65 70 75 ~ " 80 

Tyr Arg Thr Lys Thr Pro Ala Pro Val Gly Val Phe Gly Pro Gly Trp 

85 90 " 95 

Lys Ala Pro Ser Asp lie Arg Leu Gin Leu Arg Asp Asp Gly Leu He 

100 105 no 

Leu Asn Asp Asn Gly Gly Arg Ser He His Phe Glu Pro Leu Leu Pro 

115 120 125 

Gly Glu Ala Val Tyr. Ser Arg Ser -Glu Ser Met Trp Leu Val Arg Gly 

130 135 140 

Gly Lys Ala Ala Gin Pro Asp Gly His Thr Leu Ala Arg Leu Trp Gly 
14 5 150 155 ~ 160 

Ala Leu Pro Pro Asp He Arg Leu Ser Pro His Leu Tyr Leu Ala Thr 

165 170 " 175 . 

Asn Ser Ala Gin Gly Pro Trp Trp He Leu Gly Trp Ser Glu Arg Val 

. 180 185 190 

Pro Gly Ala Glu Asp Val Leu Pro Ala Pro Leu Pro Pro Tyr Arg Val 

195 . 200 . 205. 

Leu Thr Gly Met Ala Asp Arg Phe Gly Arg Thr Leu Thr Tyr Arg Arg 

210 215 220 

Glu Ala Ala Gly Asp Leu Ala Gly Glu lie Thr Gly Val Thr Asp Gly 
225 230 235 240 

Ala Gly Arg Glu Phe Arg Leu Val Leu Thr Thr Gin Ala Gin Arg Ala 

245 250 255 

Glu Glu Ala Arg Thr Ser Ser Leu Ser Ser Ser Asp Ser Ser. Arg Pro 

260 265 270 

Leu Ser Ala Ser Ala Phe Pro Asp Thr Leu Pro Gly Thr Glu Tyr Gly 

275 28 0 285 

Pro Asp Arg Gly He Arg Leu Ser Ala Val Trp Leu Met His Asp Pro 

290 295 300 

Ala Tyr Pro Glu Ser Leu Pro Ala Ala Pro Leu Val Arg Tyr Thr Tyr 
305 310 315 " ' 320 

Thr Glu Ala Gly Glu Leu Leu Ala Val Tyr Asp Arg Ser Asn Thr Gin 

325 330 " 335 

Val Arg Ala Phe Thr Tyr Asp Ala Gin His Pro Gly Arg Met Val Ala 

340 345 . 350 

His Arg Tyr Ala Gly Arg Pro Glu Met Arg Tyr Arg Tyr Asp Asp Thr 
355 360 365 
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Gly Arg Val VaX Glu Gin Leu Asn Pro Ala Gly Leu Ser Tyr Arq Tyr 

3"?0 375 380 

Leu Tyr Glu' Gin Asp Arg lie Thr Val Thr Asp Ser Leu Asn Arg Arg 
385 390 395 400 

Glu Val Leu His Thr Glu Gly Gly Ala Gly Leu Lys Arg Val Val Lys 

405 4 10 " 4 15 

Lys Glu Leu Ala Asp Gly Ser Val Thr Arg Ser Gly Tyr Asp Ala Ala 
420 425 v 430 

' Gly Arg Leu Thr Ala Gin Thr Asp Ala Ala Gly Arg Arg Thr Glu Tyr 
4 35 4 40 4 45 

Gly Leu Asn Val Val Ser Gly Asp He Thr Asp He Thr Thr Pro Asp 

450 ■ 455 460 

Gly Arg Glu Thr Lys Phe Tyr Tyr Asn Asp Gly Asn Gin Leu Thr Ala 
465 470 475 480 

Val Val Ser Pro Asp Gly Leu Glu Ser Arg Arg Glu Tyr Asp Glu Pro 

485 490 495 

Gly Arg Leu Val Ser Glu Thr Ser Arg Ser Gly Glu Thr Val Arg Tyr 

500 * 505 510 

Arg Tyr Asp Asp Ala His Ser Glu Leu Pro Ala Thr Thr Thr Asp Ala 

515 520 525 

Thr Gly Ser Thr Arg Gin Met Thr Trp Ser Arg Tyr Gly Gin Leu Leu 

530 535. 540 

Ala Phe Thr Asp Cys Ser Gly Tyr Gin Thr Arg Tyr Glu Tyr Asp Arg 
545 550 555 560 

Phe Gly Gin Met Thr Ala Val His Arg Glu Glu Gly He Ser Leu Tyr 

565 570 575 

Arg Arg Tyr Asp Asn Arg Gly Arg Leu Thr Ser Val Lys Asp Ala Gin 

580 585 590 

Gly Arg Glu Thr Arg Tyr Glu Tyr Asn Ala Ala Gly Asp Leu Thr Ala 

595 600 605 

Val He Thr Pro Asp Gly Asn Arg Ser Glu Thr Gin Tyr Asp Ala Trp 

610 615 620 

Gly Lys Ala Val Ser Thr Thr Gin Gly Gly Leu Thr Arg Ser Met Glu 
625 630 635 640 

Tyr Asp Ala Ala Gly Arg Val lie Ser Leu Thr Asn Glu Asn Gly Ser 

645 650 655 

His Ser Val Phe Ser Tyr Asp Ala Leu Asp Arg Leu Val Gin Gin Gly 

660 665 670 

Gly Phe Asp Gly Arg Thr Gin Arg Tyr His Tyr Asp Leu Thr Gly Lys 

675 . 680 L 685 

Leu Thr Gin Ser Glu Asp Glu Gly Leu Val lie Leu Trp Tyr Tyr Asp 

690 695 700 

Glu Ser Asp Arg lie Thr His Arg Thr Val Asn Gly Glu Pro Ala Glu 
705 710 715 720 

Gin Trp Gin Tyr Asp Gly His Gly Trp Leu Thr Asp lie Ser His Leu 

725 730 735 

Ser Glu Gly His Arg Val Ala Val His Tyr Gly Tyr Asp Asp Lys Gly 

740 745 ~ 750 

Arg Leu Thr Gly Glu Cys Gin Thr Val Glu Asn Pro Glu Thr Gly Glu 

755 760 765 

Leu Leu Trp Gin His Glu Thr Lys His Ala Tyr Asn Glu Gin Gly Leu 

770 775 780 

Ala Asn Arg. Val Thr Pro Asp Ser Leu Pro Pro Val Glu Trp Leu Thr 
785 790 795 * 800 

Tyr Gly Ser Gly Tyr Leu Ala Gly Met Lys Leu Gly Gly Thr Pro. Leu 

805 810 815 

Val Glu Tyr Thr Arg Asp Arg Leu His Arg Glu Thr Val Arg Ser Phe 

820 • 825 830 

Gly Ser Met Ala Gly Ser Asn Ala Ala Tyr Glu Leu Thr Ser Thr Tyr 

835 840 845 

Thr Pro Ala Gly Gin Leu Gin Ser Gin His Leu Asn Ser Leu Val Tyr 
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850 855 «860 

Asp Arg Asp Tyr Gly Trp Ser Asp Asn Gly Asp Leu Val Arg lie Ser 
86 5 870 875 880 

Gly Pro Arg Gin Thr Arg Glu Tyr Gly Tyr Ser Ala Thr Gly Arg Leu 

885 890 895 

Glu Ser Val Arg Thr Leu Ala Pro Asp Leu Asp lie Arg lie Pro Tyr 

900 905 910 

Ala Thr Asp Pro Ala Gly Asn Arg Leu Pro Asp Pro Glu Leu His Pro 

915 920 925 

Asp Ser Thr Leu Thr Val Trp Pro Asp Asn Arg He Ala Glu Asp Ala 

930 935 94 0 

His Tyr Val Tyr Arg His Asp Glu Tyr Gly Arg Leu Thr Glu Lys Thr 
94 5 950 955 960 

Asp Arg lie Pro Ala Gly Val He Arg Thr Asp Asp Glu Arg Thr His 

965 970 975 

His Tyr His Tyr Asp Ser Gin His Arg Leu Val Phe Tyr Thr Arg lie 

980 985 990 

Gin His Gly Glu Pro Leu Val Glu Ser Arg Tyr Leu Tyr Asp Pro Leu 

995 1000 1005 

Gly Arg Arg Met Ala Lys Arg Val Trp Arg Arg Glu Arg Asp Leu Thr 

1010 1015 1020 

Gly Trp Met Ser Leu Ser Arg Lys Pro Glu Val Thr Trp Tyr Gly Trp 
1025 1030 1035 " ' 1040 

Asp Gly Asp Arg Leu Thr Thr Val Gin Thr Asp Thr Thr Arg lie Gin 

1045 1050 1055 

Thr Val Tyr Glu Pro Gly Ser Phe Thr Pro Leu lie Arg Val Glu Thr 

1060 1065 . 1070 

Glu Asn Gly Glu Arg Glu Lys Ala Gin Arg Arg Ser Leu Ala Glu Thr 

1075 1080 1085 

Leu Gin Gin Glu Gly Ser Glu Asn Gly His Gly Val Val Phe Pro Ala 

1090 1095 HOO 

Glu Leu Val Arg Leu Leu Asp Arg Leu Glu Glu Glu lie Arg Ala Asp 
1105 1110 His H20 

Arg Val Ser Ser Glu Ser Arg Ala Trp Leu Ala Gin Cys Gly Leu Thr 

1125 1130 " H35 

Val Glu Gin Leu Ala Arg Gin Val Glu Pro Glu Tyr Thr Pro Ala Arg 

1140 1145 H50 

Lys Ala His Leu Tyr His Cys Asp His Arg Gly Leu Pro Leu Ala Leu 

1155 1160 H65 

He Ser. Glu Asp Gly Asn Thr Ala Trp Ser Ala Glu Tyr Asp Glu Trp 

1170 1175 H80 

Gly Asn Gin Leu Asn Glu Glu Asn Pro His His Val Tyr Gin Pro Tyr 
1185 , H90 H95 " 1200 

Arg Leu Pro Gly Gin Gin His Asp Glu Glu Ser Gly Leu Tyr Tyr Asn 

1205 1210 " 1215 

Arg His Arg Tyr Tyr Asp Pro Leu Gin Gly Arg Tyr lie Thr Gin Asp 

1220 1225 1230 

Pro Met Gly Leu Lys Gly Gly Trp Asn Leu Tyr Gin Tyr Pro Leu Asn 

1235 1240 1245 

Pro Leu Gin Gin He Asp Pro Met Gly Leu Leu Gin Thr Trp Asp Asp 

1250 1255 1260 

Ala Arg Ser Gly Ala Cys Thr Gly Gly Val Cys Gly Val Leu Ser Arg 
1265 1270 1275 1280 

He He Gly Pro Ser Lys Phe Asp Ser Thr Ala Asp Ala Ala Leu Asp 

1285 1290 1295 

Ala Leu Lys Glu Thr Gin Asn Arg Ser Leu Cys Asn Asp Met Glu Tyr 

1300 1305 1310 

Ser Gly He Val Cys Lys Asp Thr Asn Gly Lys Tyr Phe Ala Ser Lys 

1315 1320 1325 

Ala Glu Thr Asp Asn Leu Arg Lys Glu Ser Tyr Pro Leu Lvs Arg Lys 
1330 1335 1340 
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Cys Pro Thr Gly Thr Asp Arg Val Ala Ala Tyr His Thr His Gly Ala 
1345 1350 1355 1360 

Asp Ser His Gly Asp Tyr Val Asp Glu Phe Phe Ser Ser Ser Asp Lys 

1365 1370 1375 

Asn Leu Val Arg Ser Lys Asp Asn Asn Leu Glu Ala Phe Tyr Leu Ala 

1380 1385 1390 

Thr Pro Asp Gly Arg Phe Glu Ala Leu Asn Asn Lys Gly Glu Tyr lie 

1395 1400 1405 

Phe lie Arg Asn Ser Val Pro Gly Leu Ser Ser Val Cys He Pro Tyr 

1410 1415 1420 

His Asp 
1425 



<210> 341 
<211> 122 
<212> PRT 
<213> E . Coli 



<400> 341 
Met Lys Tyr Ser 
1 

Ala Cys Asn Glu 
20 

Met Arg Tyr Val 
35 

Thr Val Lys Thr 
50 

Arg Tyr Arg Tyr 
65 

He Thr Glu Leu 

Tyr Glu Val Ala 
100 

Gly Cys Val Leu 
115 



Ser He Phe Ser 
5 

Thr Ala Val Tyr 

Glu Lys Leu His 
40 

Glu Thr Met Ala 
55 

Gly Glu Arg He 
70 

Pro Asp Ser Trp 

85 

Gly Gly Val Phe 

Asn Phe Leu His 
120 



Met Leu Ser Phe 
10 

Gly Ser Asp Glu 
25 

Leu Asp Lys Tyr 

He Gin Leu Ala 
60 

Ala Glu Glu Glu 
75 

Val Val Glu Gly 
90 

He He Glu lie 
105 

Ser . Lys 



Phe lie Leu Phe 
15 

Asn He He Phe 
30 

Ser Val Lys Asn 
4 5 

Glu He Tyr Val 

Lys Pro Tyr Leu 
80 

Ala Lys Leu Pro . 
95 

Asn Lys Lys Asn 
110 



<210> 342 
<211> 236 
<212> PRT 
<213> E . Coli 
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65 
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95 
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Asp 


Val 


Ala 


Leu 


He 


Arg 
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His 
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Arg 








100 










105 










110 
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115 










120 










125 






Leu 


Gly 


Gly 


Thr 


Asp 


Gin 


Phe 


Phe 


His 


Cys 


Met 


Ala 


Phe 


Cys 


Arg 


Val 



-165- 



BNSDOCID: <WO 0044906A2_I_> 



WO 00/44906 

130 




PCT/US00/02200 



Ser Lys Leu Asn Asp Ala Gly Val Ser Arg Ser Ala Lys Gly Leu Gly 

145 150 155 ~ 160 

Tyr Glu Lys Glu lie Arg Asp Tyr Gly Leu Asn Leu Phe Gly Met Tyr 

165 170 175 

Gly Arg Lys Val Lys Leu Ser His Ser Glu Met He Glu Asp Asn Lys 

180 185 190 

Lys Asp Leu Ala Val Asn Asp His Gly Leu Thr Cys Pro Ser Thr Thr 

195 200 205 

Asp Cys Ser Asp Arg Cys Ser Asp Tyr He Asn Pro Glu His Lys Lys 

210 215 220 

Thr He Lys Ala Leu Gin Asp Ala Gly Tyr Leu Lys 

225 230 235 



<210> 343 
<21i> 86 
<212> PRT 
<213> E. Coli 





<400> 


343 
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He 
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lie 


Phe 


1 








5 










10 










15 




Ala 


He 


He 


lie 


He 


Val 


Val 


Leu 


Cys 


Val 


He 


Thr 


Tyr 


Leu 


Tyr 


Leu 








20 










25 








30 




Tyr 


Lys 


Asp 


Glu 


Ser 


Leu 


Val 


Ser 


Lys 


His 


Tyr 


lie 


Asn 


Tyr 


Met 


Ala 






35 










40 










4 5 






He 


Pro 


Glu 


Asn 


Asp 


Gly 


Val 


Phe 


Thr 


Trp 


Leu 


Pro 


Asp 


Phe 


Phe 


Pro 




50 










55 










60 








His. 


Val 


Ala 


Val 


Asp 


He 


Ser 


He 


Tyr 


Thr 


Asn 


Val 


Glu 


Asp 


Asp 


Tyr 


65 










70 










75 






80 
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Phe 


Leu 


He 


Phe 


Pro 























85 



<210> 344 

<211> 63 

<212> PRT 

<213> E. Coli 

<400> 344 

Met Arg Ala Arg Glu Gin Val Ala Lys lie Val Ser Lys Asn Asp Pro 

1 5. .10 15 

Asp Thr Lys Lys Val Trp Cys Lys Tyr Gly Lys lie Pro Gly Gin Gly 

20 25 30 

Asp Gly Val Asn Leu Phe Phe Val Gly Glu lie Asn Val Thr His Tyr 

35 4 0 45 - J 

Phe lie Thr Asn lie Gly Ala Gly Leu Pro Asp Ala Cys Ala Glu 
'50 55 60 



<210> 345 
<211> 167 
<212> PRT 
<213> E. Coli 



' <400> 345 

Met Pro Gly Asn Ser Pro His Tyr Gly Arg Trp Pro Gin His Asp Phe 
15 10 15 
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Thr Ser Leu Lys Lys Leu 
20 

Pro Gly Ser Asp Val lie 
35 

Tyr Val Gly Ala Lys Ser 
50 

Ser Leu' Arg Lys Thr Val 
65 70 
Ala Thr Leu Gly Arg. Leu 
85 

lie Trp Met Thr Asp Gly 
100 

Lys Leu His Val lie Ser 
■ 115 

Asn Leu Asn Leu Arg Gin 
130 

Ser Phe Ser Lys Ser Val 
145 150 
Leu Asn lie Lys His Tyr 
165 



Arg Pro Gin Ser Val 
25 

Val' Cys Ala Glu Met 
40 

Arg Gin Arg Trp Leu 
55 

Val Ala His Val Phe 
75 

Met Ser Leu Leu Ser 
90 

Trp Pro Leu Tyr Glu 
105 

Lys Arg Tyr Thr Gin 
120 

His Leu Ala Arg Leu 
135 

Glu Leu His Asp Lys 
155 

Gin 




PCT/US00/02200 ,. 



Thr Ser Arg lie Gin 
30 

Asp Glu Gin Trp Gly 
45 

Phe Tyr Ala Tyr Asp 
60 

Gly Glu Arg Thr Met 
80 

Pro Phe Asp Val Val 

• . 95 

Ser Arg Leu Lys Gly 
110 

Arg lie Glu Arg His 
125 

Gly Arg Lys Ser Leu 
140 

Val lie Gly His Tyr 
160 



<210> 346 

<211> 91 

<212> PRT 

<213> E. Coli 

<400> 346 

Met Ala Ser Val Ser lie Ser Gys Pro Ser Cys Ser Ala Thr Asp Gly 

1 "5 10 15 

Val Val Arg Asn Gly Lys Ser Thr Ala Gly His Gin Arg Tyr Leu Cys 

20 25 30 

Ser His Cys Arg Lys Thr Trp Gin Leu Gin Phe Thr Tyr Thr Ala Ser 

35 40 45 

Gin Pro Gly Thr His Gin Lys He He Asp Met Ala Met Asn Gly Val 

50 55 60 

Gly Cys Arg Ala Thr Ala Arg He Met Gly Val Gly Leu Asn Thr lie 
65 70 -75 80 

Leu Arg His Leu Lys Asn Ser Gly Arg Ser Arg 
85 90 



<210> 347 

<211> 138 

<212> PRT 

<213> E.. Coli 





<400> 


347 


























Met 


Met 


Thr 


Lys 


Thr 


Gin 


He 


Asn 


Lys 


Leu 


He 


Lys 


Met 


Met 


Asn 


Asp 


1 








5 










10 










15 


Leu 


Asp 


Tyr 


Pro 


Phe 


Glu 


Ala 


Pro 


Leu 


Lys 


Glu 


Ser 


Phe 


He 


Glu 


Ser 








20 










25 










30 






He. 


He 


Gin 


He 


Glu 


Phe 


Asn 


Ser 


Asn 


Ser 


Thr 


Asn 


Cys 


Leu 


Glu 


Lys 






35 










40 










45 






Leu 


Cys 


Asn 


Glu 


Val 


Ser 


He 


Leu 


Phe 


Lys 


Asn 


Gin 


Pro 


Asp 


Tyr 


Leu 




50 










55 










60 






Thr 


Phe 


Leu 


Arg 


Ala 


Met 


Asp 


Gly 


Phe 


Glu 


Val 


Asn 


Gly 


Leu 


Arg 


Leu 


65 










70 










7 5 






80 


Phe 


Ser 


Leu 


Ser 


He 


Pro 


Glu 


Pro 


Ser 


Val 


Lys 


Asn 


Leu 


Phe 


Ala 


Val 



85 90 95 
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Asn Glu Phe Tyr Arg Asn Asn Asp Asp Phe lie Asn Pro Asp Leu Gin 

100 105 no 

Glu Arg Leu Val He Gly Asp Tyr Ser He Ser He Phe Thr Tyr Asp 

115 120 125 

He Lys Gly Asp Ala Ala Asn Leu Leu He 
130 135 



<210> 348 

<211> 392 

<212> PRT 

<213> E . Coli • 

<400> 348 

Met Ser Asn He Val Tyr Leu Thr Val Thr Gly Glu Gin Gin Gly Ser 

15 10 15 

He Ser Ala Gly Cys Gly Thr Ser Glu Ser Thr Gly Asn Arg. Trp Gin 

20 25 30 

Ser Gly His Glu Asp Glu He Phe Thr Phe Ser Leu Leu Asn Asn lie 

35 40 45 

Asn Asn Thr Gly Leu Gly Ser Gin Phe His Gly He Thr Phe Cys Lvs 

50 55 60 

Leu lie Asp Lys Ser Thr Pro Leu Phe lie Asn Ser lie Asn Asn Asn 
65 70 75 80 

Glu Gin Leu Phe Met Gly Phe Asp Phe Tyr Arg lie Asn Arg Phe Gly 

85 90 95 

Arg Leu Glu Lys Tyr Tyr Tyr lie Gin Leu Arg Gly Ala Phe Leu Ser 

100 105 no 

Ala lie His His Gin lie lie Glu Asn Gin Leu Asp Thr Glu Thr lie 

115 120 - 125 

Thr lie Ser Tyr Glu Phe He Leu Cys Gin His Leu lie Ala Asn Thr 

130 135 140 

Glu Phe Ser Tyr Leu Ala Leu Pro Glu Asn Tyr Asn Arg Leu Phe Leu 
145 150 155 ' 160 

Pro Asn Ser Lys Asn Gin Thr Asn Asn Arg Phe Lys Thr Leu Asn Ser 

165 170 J 175 

Lys Ala lie Gly Arg Leu Leu Ala Ala Gly Gly Val Tyr Asn Gly Asn 

180 185 " 190 

lie Glu Gly Phe Arg Asp Thr Ala Glu Lys Leu Gly Gly Asp Ala lie 

195 200 205 

Lys Gly Tyr Asp Gin He Leu Asn Glu Lys Thr Ala Gly lie Ala lie 

210 215 220 

Ala Thr Ala Ser lie Leu Leu Thr Lys Arg Ser Asn Val Asp Thr Tyr • 
225 230 235 . ' 240 

Thr Glu lie Asn Ser Tyr Leu Gly Lys Leu Arg Gly Gin Gin Lys Leu 

245 250 255 

Leu Asp Gly lie Asp He lie Glu lie lie Tyr lie Lys Arg ' Pro Ser 

260 265 ~ 270 

Lys Asp Leu Ala Asn Leu Arg Lys Glu Phe Asn Lys Thr Val Arg Lys 

275 280 285 

Asn Phe Leu lie Lys Leu Ala Lys Thr Ser Glu Ala Ser Gly Arg Phe 

290 295 300 

Asn Ala Glu Asp Leu Leu Arg Met Arg Lys Gly Asn Val Pro Leu Asn 
305 310 315 320 

Tyr Asn Val His His Lys Leu Ser Leu Asp Asp Gly Gly Thr Asn Asp 

325 330 ~ 335 

Phe Glu Asn Leu Val Leu lie Glu Asn Glu Pro Tyr His Lys Val Phe 

340 345 350 

Thr Asn Met Gin Ser Arg lie Ala Lys Gly lie Leu Val Gly Glu Ser 

355 360 365 

Lys lie Thr Pro Trp Ala lie Pro Ser Gly Ser lie Tyr Pro Pro Met 
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370 










375 






Lys 


Asn 


He 


Met 


Asp 


His 


Thr 


Lys 




385 










390 










<210> 


34 9 














<211> 


221 














<212> 


PRT 














<213> 


E . 


~oli 












<400> 


34 9 












Met 


Val 


Leu 


Ala 


Leu 


Asn 


Tyr 


Asn 


Met 


1 








. 5 










Glu 


Asn 


Ala 


Ala 


Lys 


Pro 


His 


Thr 


Met 








20 










25 


Tyr 


He 


Arg 


Ser 


He 


Glu 


Lys 


Asn 


Gly 






35 










40 




Gly 


Lys 


Leu 


Arg 


Tyr 


Ser 


Asp 


Glu 


Leu 




50 










55 






Thr 


Phe 


Leu 


Asp 


Ser 


Lys 


Arg 


Gin 


Leu 


65 










70 








Gly 


He 


Lys 


Thr 


Lys 


He 


He 


Asp 


Tyr 










85 










Ala 


Asn 


Thr 


Val 


Ala 


Phe 


Glu 


Asp 


Val 








100 










105 


Asp 


Phe 


He 


Leu 


Cys 


Ser 


Asn 


Val 


Leu 






115 










120 




Thr 


He 


Asp 


Lys 


He 


Val 


Leu 


Ser 


He 




130 










135 






Gly 


Glu 


Thr 


Leu 


lie 


Val 


Asn 


Gin 


Tyr 


145 










150 








Tyr 


Glu 


Thr 


Gly 


Arg 


Lys 


His 


Leu 


Tyr 










165 










Lys 


Ser 


Val 


Ser 


Tyr 


Tyr 


Gly 


Leu 


Leu 








180 










185 


He 


Cys 


Ser 


Ser 


His 


Gly 


Leu 


Glu 


He 






195 










200 




Gly 


Ser 


Ser 


Tyr 


Val 


Thr 


Val 


Gly 


Ser 




210 










215 








<210> 


350 














<211> 


234 














<212> 


PRT 














<213> 


E . Coli 












<400> 


350 












Met 


Asn 


Asn 


Met 


Phe 


Glu 


Pro 


Pro 


Lys 


1 








5 










Lys 


Leu 


His 


Lys 


Ala 


Thr 


Phe 


Leu 


Asn 








20 










25 


Leu 


Val 


He 


Tyr 


Glu 


Tyr 


He 


Pro 


Leu 






35 










40 




Val 


Pro 


Pro 


He 


Lys 


Asp 


His 


Glu 


Ser 




50 










c c 
DO 






Phe 


Gly 


He 


Leu 


Pro 


Cys 


Ala 


Phe 


Ala 


65 










70 








Gly' 


Ala 


Leu 


Asp 


Leu 


His 


Asn 


Asn 


Ala 










85 










Tyr 


Leu 


Trp 


Asp 


Lys 


His 


Leu 


He 


He 
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380 



His 


Gly 


Val 


Asn 


He 


Arg 


Ser 


10 










15 




Pro 


Ser 


Arg 


Tyr 


Leu 


Cys 


Glu 










30 






His 


Ala 


Leu 


Asp 


Phe 


Gly 


Cys 








45 








He 


Ser 


Lys 


Phe 


Asp 


Glu 


Val 






60 










Glu 


Arg 


Glu 


Gin 


lie 


lie 


Arg 




75 










80 


Val 


Pro 


Arg 


Tyr 


Tyr 


Lys 


Asn 


90 










95 




Asp 


Lys 


lie 


lie 


Gly 


Gly 


Tyr 










110 






Ser 


Ala 


Val 


Pro 


Cys 


Arg 


Asp 








125 








Lys 


Arg 


Leu 


Leu 


Lys 


Ser 


Gly 






140 










Lys 


Ser 


Ser 


Tyr 


Phe 


Lys 


Lys 




155 










160 


Gly 


Tyr 


lie 


Tyr 


Lys 


Asn 


Ser 


170 










175 




Asp 


Glu 


Leu 


Ala 


Val 


Gin 


Glu 










190 
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Lys 


Ser 


Trp 


Ser 


Lys 


Ala 








205 








Cys 
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Ala 


lie 












220 
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Tyr 


Asn 


Glu 


Met 


Leu 
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10 










15 




Thr 


Leu 


lie 
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Cys 


lie 


Leu 










30 






He 


Thr 
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Pro 


Thr 
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Tyr 








45 
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Ala 
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Ser 






60 










He 


Phe 


Ala 
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Leu 


lie 


Ser 
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80 


Ala 
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Arg 


Val 


Arg 


90 










95 




Lys 


Pro 


Leu 


Ser 


Arg 


Arg 


Ala 
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100 










105 










110 






Gly 


Val 
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Arg 


Lys 


Leu 
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Lys 


Asp 


Glu 
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His 


Asn 


Val 


Met 


Ser 






115 










12 0 










125 








Asn 


Leu 


Tyr 


Tyr 


Pro 


Glu 


Val 


Arg 


Lys 


He 


Glu 


Asp 


Lys 


His 


Tyr 


He 




130 










135 










140 










Glu 


Leu 


Phe 


Trp 


Asn 


Lys 


Val 


Tyr 


Tyr 


Phe 


Trp 


He 


Phe 


Phe 


Glu 


Phe 


145 










150 










155 










160 


Ser 


lie 


lie 


Ala 


Leu 


He 


Ser 


Phe 


Leu 


He 


He 


Phe 


Phe 


Cys 


Lys 


Gin 










165 










170 










175 




Met 


Asp 


He 


Phe 


His 


Val 


Glu 


Gly 


Ser 


Leu 


Leu 


Ser 


Leu 


Phe 


Phe 


Phe 








180 










185 










190 






Val 


lie 


Leu 


Ser 


Phe 


Ser 


Val 


Ser 


Gly 


He 


He 


Phe . 


Ala 


Leu 


Thr 


Val 






195 










200 










205 








Lys' 


Pro 


Arg 


Thr 


Glu 


Ser 


Gin 


Val 


Gly 


Lys 


lie 


Pro 


Asp 


Asp 


Lys 


He 




210 










215 










220 










Lys 


Glu 


Phe 


Phe 


Thr 


Lys 


Asn 


Asn 


He 


Asn 














225 










230 























<210> 351 

<211> 94 

<212> PRT 

<213> E . Coli 

<400> 351 

Met Phe thr lie Asn Ala Glu Val Arg Lys Glu Gin Gly Lys Gly Ala 

1 "5 10 15 

Ser Arg Arg Leu Arg Ala Ala Asn Lys Phe Pro Ala He lie Tyr Gly 

20 25 30 

Gly Lys Glu Ala Pro Leu Ala lie Glu Leu Asp His Asp Lys Val Met 

35 40 45 

Asn Met Gin Ala Lys.-.Ala Glu Phe Tyr Ser Glu Val Leu Thr lie Val 

50 55 60 

Val Asp Gly Lys Glu lie Lys Val Lys Ala Gin Asp Val Gin Arg His 
65 70 75 80 

Pro Tyr Lys Pro Lys Leu Gin His lie Asp Phe Val Arg Ala 
85 90 



<210> 352 
<211> 658 
<212> PRT 
<213> E. Coli 





<400> 


352 
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Val 
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Phe 


Tyr 


Arg 
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His 


Trp 


Arg 


Asp 
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Asp 
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10 










15 




Val 
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lie 


Met 


Met 


Asn 


Leu 


Thr 


Thr 


Leu 


Thr 


His 


Arg 


Asp 


Ala 


Leu 








20 










25 










30 
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Ala 


Arg 


Phe 


Thr 


Ser 


Arg 


Glu 


Glu 


Ala 


He 


His 


Ala 


Leu 






35 










40 










45 








Thr 


Gin 


Arg 


Leu 


Ala 


Ala 


Leu 


Gly 


Lys 


lie 


Ser 


Ser 


Thr 


Glu 


Gin 


Phe 




50 










55 










60 
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Glu 


Glu 


Val 
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Arg 


Arg 


Glu 


Ser 


Leu 


Gly 


Pro 
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Ala 
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Gly 


65 










70 










75 










80 


Glu 


Gly 
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Ala 


Val 


Pro 


His 


Gly 
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Thr 


Ala 


Ala 


Val 


Lys 


Glu 


Ala 










85 










90 










95 




Ala 
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Ala 


Val 


Ala 


Thr 


Leu 


Ser 


Glu 


Pro 


Leu 


Gin 


Trp 


Glu 


Gly 


Val 








100 










105 










110 




Asp 
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Glu 
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Val 


Asp 
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Val 


Val 


Leu 
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Ala 


lie 


Pro 


Pro 
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115 120 125 

Asn Glu Ala Gly Thr Thr His Met Gin Leu Leu Thr Ala Leu Thr Thr 

130 135 140 

Arg Leu Ala Asp Asp Glu He Arg Ala Arg He Gin Ser Ala Thr Thr 
145 150 155 leo 

Pro Asp Glu Leu Leu Ser Ala Leu Asp Asp Lys Gly Gly Thr Gin Pro 

165 170 ~ " 175 

Ser Ala Ser Phe Ser Asn Ala Pro Thr He Val Cys Val Thr Ala Cys 

180 185 " 190 

Pro Ala Gly He Ala His Thr Tyr Met Ala Ala Glu Tyr Leu Glu Lys 

195 200 205 

Ala Gly Arg Lys Leu Gly Val Asn Val Tyr Val Glu Lys Gin Gly Ala 

210 215 220 ' 

Asn Gly He Glu Gly Arg Leu Thr Ala Asp Gin Leu Asn Ser Ala Thr 
225 230 235 240 

Ala Cys He Phe Ala Ala Glu Val Ala He Lys Glu Ser Glu Arg Phe 

245 250 255 

Asn Gly lie Pro Ala Leu Ser Val Pro Val Ala Glu Pro He Arg His 

• 260 265 270 

Ala Glu Ala Leu' lie Gin Gin Ala Leu Thr Leu Lys Arg Ser Asp Glu 

275 280 285 

Thr Arg Thr Val Gin Gin Asp Thr Gin Pro Val Lys Ser Val Lys Thr 

290 295 300 

Glu Leu Lys Gin Ala Leu Leu Ser Gly He Ser Phe Ala Val Pro Leu 
305 310 315 320 

He Val Ala Gly Gly Thr Val Leu Ala Val Ala Val Leu Leu Ser Gin 

325 330 335 

He Phe Gly Leu Gin Asp Leu Phe Asn Glu Glu Asn Ser Trp Leu Trp 

340 345 350 

Met Tyr Arg Lys Leu Gly Gly Gly Leu Leu Gly lie Leu Met Val Pro 

355 360 365 

Val Leu Ala Ala Tyr- Thr Ala Tyr Ser Leu Ala Asp Lys Pro Ala Leu 

370 375 . 380 

Ala Pro Gly Phe Ala Ala Gly Leu Ala Ala Asn Met lie Gly Ser Gly 
385 390 395 4 00 

Phe Leu Gly Ala Val Val Gly Gly Leu lie Ala Gly Tyr Leu Met Arg 

4 05 410 415 

Trp Val Lys Asn His Leu Arg Leu Ser Ser Lys Phe Asn Gly Phe Leu 

420 425 " 430 

Thr Phe Tyr Leu Tyr Pro Val Leu Gly Thr Leu Gly Ala Gly Ser Leu 

435 440 445 

Met Leu Phe Val Val Gly Glu Pro Val Ala Trp lie Asn Asn Ser Leu 

450 455 . 460 

Thr Ala Trp Leu Asn Gly Leu Ser Gly Ser Asn Ala Leu Leu Leu Gly 
4^5 470 475 480 

Ala lie Leu Gly Phe Met Cys Ser Phe Asp Leu Gly Gly Pro Val Asn 

485 490 " ^ 495. 

Lys Ala Ala Tyr Ala Phe Cys Leu Gly Ala Met Ala Asn Gly Val Tyr 

500 505 510 

Gly Pro Tyr Ala lie Phe Ala Ser Val Lys Met Val Ser Ala Phe Thr 

515 520 525 

Val Thr Ala Ser Thr Met Leu Ala Pro Arg Leu Phe Lys Glu Phe Glu . 

530 535 540 

lie Glu Thr Gly Lys Ser Thr Trp Leu Leu Gly Leu Ala Gly lie Thr 
545 550 555 - 560 

Glu Gly Ala lie Pro Met Ala lie Glu Asp Pro Leu Arg Val lie Gly 

565 570 " 575 

Ser Phe Val Leu Gly Ser Met Val Thr Gly Ala lie Val Gly £la Met 

580 585 590 

Asn lie Gly Leu Ser Thr Pro Gly Ala Gly lie Phe Ser Leu Phe Leu 
595 600 6 05 
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Leu His Asp Asn Gly Ala Gly Gly Val Met Ala Ala He Gly Trp Phe 

610 615 620 

Gly Ala Ala Leu Val Gly Ala Ala He Ser Thr Ala He Leu Leu Met 
625 630 635 640 

Trp Arg Arg His Ala Val Lys His Gly Asn Tyr Leu Thr Asp Gly Val 
645 650 655 

Met Pro 



<210> 353 

<211> 877 

<212> PRT 

<213> E . Coli 

<400> 353 ' 

Met Lys Ala Val Ser Arg. Val His He Thr Pro His Met His Trp Asp 

1 ' 5 10 is 

Arg Glu Trp Tyr Phe Thr Thr Glu Glu Ser Arg He Leu Leu Val Asn 

20 25 30 

Asn Met Glu Glu He Leu Cys Arg Leu Glu Gin Asp Asn Glu Tyr Lys 

35 AO 45 

Tyr Tyr Val Leu Asp Gly Gin Thr Ala He Leu Glu Asp Tyr Phe Ala 

50 55 60 

•Val Lys Pro Glu Asn Lys Asp Arg Val Lys Lys Gin Val Glu Ala Gly 
65 70 75 80 

Lys Leu He He Gly Pro Trp Tyr Thr Gin Thr Asp Thr Thr lie Val 

85 90 95 

Ser Ala Glu Ser He Val Arg Asn Leu Met Tyr Gly. Met Arg Asp Cys 

100 105 no 

Leu Ala Phe Gly Glu Pro Met Lys He Gly Tyr Leu Pro Asp Ser Phe 

115 120 125 

Gly Met Ser Gly Gin Leu Pro His He Tyr Asn Gly Phe Gly He Thr 

130 135 140 

Arg Thr Met Phe Trp Arg Gly Cys Ser Glu Arg His Gly Thr Asp Lys 

150 155 160 

Thr Glu Phe Leu Trp Gin Ser Ser Asp Gly Ser Glu Val Thr Ala Gin 

165 170 175 

Val Leu Pro Leu Gly Tyr Ala lie Gly Lys Tyr Leu Pro Ala Asp Glu 

180 185 190 

Asn Gly Leu Arg Lys Arg Leu Asp Ser Tyr Phe Asp Val Leu Glu Lys 

195 200 205 

Ala Ser Val Thr Lys. Glu He Leu Leu Pro Asn Gly His Asp Gin Met 

210 215 220 

Pro . Leu Gin Gin Asn He Phe Glu Val Met Asp Lys Leu Arg Glu lie 
225 230 235 ^ 240 

Tyr Pro Gin Arg Lys Phe Val Met Ser Arg Phe Glu Glu Val Phe Glu 

245 250 255 

Lys He Glu Ala Gin Arg Asp Asn Leu Ala Thr Leu Lys Gly Glu Phe 

260 265 270 

He Asp Gly Lys Tyr Met Arg Val His Arg Thr He Gly Ser Thr Arg 

2 "75 280 285 

Met Asp lie Lys He Ala His Ala Arg lie Glu Asn Lys lie Val Asn 

290 295 300 

Leu Leu Glu Pro Leu Ala Thr Leu Ala Trp Thr Leu Gly Phe Glu Tyr 
305 310 315 320 

His His Gly Leu Leu Glu Lys Met Trp Lys Glu lie Leu Lys Asn His 

325 330 335 

Ala His Asp Ser lie Gly Cys Cys Cys Ser Asp Lys Val His Arg Glu 

34 0 34 5 * 350 

lie Val Ala A.rg Phe Glu Leu Ala Glu. Asp Met Ala Asp Asn Leu lie 
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355 360 365 

Arg Phe Tyr Met Arg Lys lie Ala Asp Asn Met Pro Gin Ser Asp Ala 

370 375 380 

Asp Lys Leu Val Leu Phe Asn Leu Met Pro Trp Pro Arg Glu Glu Val 
385 390 395 400 

lie Asn Thr Thr Val Arg Leu Arg Ala Ser Gin Phe Asn Leu Arg Asp 

405 410 415 

Asp Arg Gly Gin Pro Val Pro Tyr Phe He Arg His Ala Arg Glu He 

420 425 430' 

Asp Pro Gly Leu He Asp Arg Gin He Val His Tyr Gly Asn Tyr Asp 

4 35 4 40 44 5 

Pro Phe Met Glu Phe Asp He Gin lie Asn Gin lie Val Pro Ser Met 

450 , 455 460 

Gly Tyr Arg Thr Leu Tyr He Glu Ala Asn Gin Pro Gly Asn Val He 
465 470 475 ' 480 

Ala Ala Lys Ser Asp Ala Glu Gly He Leu Glu Asn Ala Phe Trp Gin 

485 490 495 

lie Ala Leu Asn Glu Asp Gly Ser Leu Gin Leu Val Asp Lys Asp Ser 

5 °0 505 510 

Gly Val Arg Tyr Asp Arg Val Leu Gin He Glu Glu Ser Ser Asp Asp 

515 520 525 

Gly Asp Glu Tyr Asp Tyr Ser Pro Ala Lys Glu Glu Trp Val lie Thr 

530 535 540 

Ala Ala Asn Ala Lys Pro Gin Cys Asp lie lie His Glu Ala Trp Gin 
545 550 555 560 

Ser Arg Ala Val lie Arg Tyr Asp Met Ala Val Pro Leu Asn Leu Ser 

565 -570 575 

Glu Arg Ser Ala Arg Gin Ser Thr Gly Arg Val Gly Val Val Leu Val 

580 585 590 

Val Thr Leu Ser His Asn Ser Arg Arg lie Asp Val Asp lie Asn Leu 

595 600 605 

Asp Asn Gin Ala Asp- Asp His Arg Leu Arg Val Leu Val Pro Thr Pro 

610 615 620 

Phe Asn Thr Asp Ser Val Leu Ala Asp Thr Gin Phe Gly Ser Leu Thr 
625 630 635 ' 640 

Arg Pro Val Asn Asp Ser Ala Met Asn Asn Trp Gin Gin Glu Gly Trp 

645 650 655 

Lys Glu Ala Pro Val Pro Val Trp Asn Met Leu Asn Tyr Val Ala Leu 

660 665 670 

Gin Glu Gly Arg Asn Gly Met Ala Val Phe Ser Glu Gly Leu Arg Glu 

675 680 685 

Phe Glu Val He Gly Glu Glu Lys Lys Thr Phe Ala lie Thr Leu Leu 

690 695 700 

Arg Gly Val Gly Leu Leu Gly Lys Gl.u Asp- Leu Leu Leu Arg Pro Gly 
705 710 715 720 

Arg Pro Ser Gly lie Lys Met Pro Val Pro Asp Ser Gin Leu Arg Gly 

725 730 735 

Leu Leu Ser Cys Arg Leu Ser Leu Leu Ser Tyr Thr Gly Thr Pro Thr 

740 745 750 

Ala Ala Gly Val Ala Gin Gin Ala Arg Ala Trp Leu Thr Pro Val Gin 

755 760 765 

Cys Tyr Asn Lys He. Pro Trp Asp Val Met Lys Leu Asn Lys Ala Glv 

770 775 ~ 780 

Phe Asn Val Pro Glu Ser Tyr Ser Leu Leu Lys Met Pro Pro Val Gly 
785 790 795 800 

Cys Leu lie Ser Ala Leu Lys Lys Ala Glu Asp Arg Gin Glu Val lie 

605 810 815 

Leu Arg Leu Phe Asn Pro Ala Glu Ser Ala Thr Cys Asp Ala Thr Val 

820 825 830 

Ala Phe Ser Arg Glu Val lie Ser Cys Ser Glu Thr Met Met Asp Glu 
835 840 845 
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His lie Thr Thr Glu Glu Asn Gin Gly Ser Asn Leu Ser Gly Pro Phe 

•8 50 85 5 8 60 

Leu Pro Gly Gin Ser Arg Thr Phe Ser Tyr Arg Leu Ala 

865 870 875 



<210> 354 

<211> 523 

<212> PRT 

<213> E . Coli 

<400> 354 . 

Met Met Leu Asp lie Val Glu Leu Ser Arg Leu Gin Phe Ala Leu Thr 

1 5 - 10 15 

Ala Met Tyr His Phe Leu Phe Val Pro Leu Thr Leu Gly Met Ala Phe 

20 25 30 

Leu Leu Ala lie Met Glu Thr Val Tyr Val Leu Ser Gly Lys Gin lie 

35 40 45 

Tyr Lys Asp Met Thr Lys Phe Trp Gly Lys Leu Phe Gly lie Asn Phe 

50 55 60 

Ala Leu Gly Val Ala Thr Gly Leu Thr Met Glu Phe Gin Phe Gly Thr 
65 70 75 80 

Asn Trp Ser Tyr Tyr Ser His Tyr Val Gly Asp lie Phe Gly Ala Pro 

85 90 .95 

Leu Ala lie Glu Gly Leu Met Ala Phe Phe Leu Glu Ser Thr Phe Val 

100 105 no 

Gly Leu Phe Phe Phe Gly Trp Asp Arg Leu Gly Lys Val Gin His Met 

115 120 ' 125 

Cys' Val Thr Trp Leu Val Ala Leu Gly Ser Asn Leu Ser Ala Leu Trp 

130 135 140 

lie Leu Val Ala Asn Gly Trp Met Gin Asn Pro lie Ala Ser Asp Phe 
145 ,'150 . 155 160 

Asn Phe Glu Thr Met Arg Met Glu Met Val Ser Phe Ser Glu Leu Val 

165 170 175 

Leu Asn Pro Val Ala Gin Val Lys Phe Val His Thr Val Ala Ser Gly 

180 185 190 

Tyr Val Thr Gly Ala Met Phe lie Leu Gly lie Ser Ala Trp Tyr Met 

195 200 . 205 ' 

Leu Lys Gly -Arg Asp Phe Ala Phe Ala Lys Arg Ser Phe Ala lie Ala 

210 215 220 

Ala Ser Phe Gly Met Ala Ala Val Leu Ser Val lie Val Leu Gly Asp 
225 230 235 240 

Glu Ser Gly Tyr Glu Met Gly Asp Val Gin Lys Thr Lys Leu Ala Ala 

245 250 255 

lie Glu Ala Glu Trp Glu Thr Gin Pro Ala Pro Ala Ala Phe Thr Leu 

260 265 270 

Phe Gly lie Pro Asp Gin Glu Glu Glu Thr Asn Lys Phe Ala lie Gin 

275 280 285 

lie Pro Tyr Ala Leu Gly lie lie Ala thr Arg Ser Val Asp Thr Pro 

290 295 300 

Val lie Gly Leu Lys Glu Leu Met Val Gin His Glu Glu Arg lie Arg 
305 310 315 " 320 

Asn Gly Met Lys Ala Tyr Ser Leu Leu Glu Gin Leu Arg Ser Gly Ser 

325 330 335 

Thr Asp Gin Ala Val Arg Asp Gin Phe Asn Ser Met Lys Lys Asp Leu 

340 345 350 

Gly Tyr Gly Leu Leu Leu Lys Arg Tyr Thr Pro Asn Val Ala Asp Ala 

355 360 365 

Thr Glu Ala Gin lie Gin Gin Ala Thr Lys Asp Ser lie Pro Arg Val 

370 375 380 

Ala Pro Leu Tyr Phe Ala Phe Arg lie Met Val Ala Cys Gly" Phe Leu 
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385 390 395 400 

Leu Leu Ala lie lie Ala Leu Ser Phe Trp Ser Val lie Arg Asn Arg 

405 410 415 

lie Gly Glu Lys Lys Trp Leu Leu Arg Ala Ala Leu Tyr Gly lie Pro 

420 425 430 

Leu Pro Trp He Ala Val Glu Ala Gly Trp Phe Val Ala Glu Tyr Gly 

4 35 4 40 4 45 

Arg Gin Pro Trp Ala He Gly Glu Val Leu Pro Thr Ala Val Ala Asn 

450 455 460 

Ser Ser Leu Thr Ala Gly Asp Leu lie' Phe Ser Met Val Leu He Cys 
465 ^ 470 475 480 

Gly Leu Tyr Thr Leu Phe Leu Val Ala Glu Leu Phe Leu Met Phe Lys 

485 490 495 

Phe Ala Arg Leu Gly Pro Ser Ser Leu Lvs Thr Gly Arg Tyr His Phe 
500. 505 510 

' Glu Gin Ser Ser Thr Thr Thr Gin Pro Ala Arg 
515 520 



<210> 355 
<211> 379 
<212> PRT 
<213> E . Coli 

<400> 355 

Met He Asp Tyr Glu Val Leu Arg Phe He Trp Trp Leu Leu Val Gly 

1 5 10 15 

Val Leu Leu He Gly Phe Ala Val Thr Asp Gly Phe Asp Met Gly Val 

20 25 30 

Gly Met Leu Thr Arg Phe Leu Gly Arg Asn Asp Thr Glu Arg Arg He 

35 4 0 45 

Met He Asn Ser lie Ala Pro His Trp Asp Gly Asn Gin Val Trp Leu 

50 55 60 

He Thr Ala Gly Gly Ala Leu Phe Ala Ala. Trp Pro Met Val Tyr Ala 
65 "70 75 80 

Ala Ala Phe Ser Gly Phe Tyr Val Ala Met He Leu Val Leu Ala Ser 

85 90 95 

Leu Phe Phe Arg Pro Val Gly Phe Asp Tyr Arg Ser Lys He Glu Glu 

100 105 ~ HO 

Thr Arg Trp. Arg Asn Met Trp Asp Trp Gly He Phe He Gly Ser Phe 

115 120 125 

Val Pro Pro Leu Val He Gly Val Ala Phe Gly Asn Leu Leu Gin Gly 

130 135 140 

Val Pro Phe Asn Val Asp Glu Tyr Leu Arg Leu Tyr Tyr Thr Gly Asn 
145 150 155 ' 160 

Phe Phe Gin Leu Leu Asn Pro Phe Gly Leu Leu Ala Gly Val Val Ser 

165 170 175 

Val Gly Met He He Thr Gin Gly Ala Thr Tyr Leu Gin Met Arg Thr 

180 185 ~ 190 

Val Gly Glu Leu His Leu Arg Thr Arg Ala Thr Ala Gin Val Ala Ala 

195 200 205 

Leu Val Thr Leu Val Cys Phe Ala Leu Ala Gly Val Trp Val Met Tyr 

210 215 ' 220 

Gly lie Asp Gly Tyr Val Val Lys Ser Thr Met Asp His Tyr Ala Ala 
225 230 235 240 

Ser Asn Pro Leu Asn Lys Glu Val Val Arg Glu Ala Gly Ala Trp Leu 

245 250 " 255 

Val Asn Phe Asn Asn Thr Pro He Leu Trp Ala He Pro Ala Leu Gly 

260 265 270 

Val Val Leu Pro Leu Leu Thr He Leu Thr Ala Arg Met Asp Lys Ala 
275 280 285 



-175- 



BNSDOCID: <WO 0044906A2_I_> 



WO 00/44906 PCT/US00/02200 



Ala Trp Ala Phe Val Phe Ser Ser Leu Thr Leu Ala Cys lie lie Leu 

290 295 300 

Thr Ala Gly lie Ala Met Phe Pro Phe Val Met Pro Ser Ser Thr Met 

305 310 315 320 

Met Asn Ala Ser Leu Thr Met Trp Asp Ala Thr Ser Ser Gin Leu Thr 

325 330 335 

Leu Asn Val Met Thr Trp Val Ala Val Val Leu Val Pro lie lie Leu 

340 345 350 

Leu Tyr Thr Ala Trp Cys Tyr Trp Lys Met Phe Gly Arg He Thr Lys 

355 360 365 

Glu Asp He Glu Arg Asn Thr His Ser Leu Tyr 
370 375 



<210> 356 
<211> 456 
<212> PRT 
<213> E. Coli 



<400> 356 



Met 


Glu 


Leu 


Ser 


Ser 


Leu 


Thr 


Ala 


Val 


Ser 


Pro 


Val 


Asp 


Gl v 


Arg 


x yi 


1 








5 










10 






15 


Gly 


Asp 


Lys 


Val 


Ser 


Ala 


Leu 


Arg 


Gly 


He 


Phe 


Ser 


Glu 


T vr 


Gly 










20 










25 










30 




Leu 


Lys 


Phe 


Arg 


Val 


Gin 


Val 


Glu 


Val 


Arq 


Trp 


Leu 


Gin 


Lys 


Le u 


Ala 






35 










40 










45 






Ala 


His 


Ala 


Ala 


lie 


Lys 


Glu 


Val 


Pro 


Ala 


Phe 


Ala 


Ala 


Asp 


Ala 


X le 




50 










55 










60 








Gly 


Tyr 


Leu 


Asp 


Ala 


He 


Val 


Ala 


Ser 


Phe 


Ser 


Glu 


Glu 


Asp 


Ala 


Ala 


65 










70 










75 








80 


Arg 


He 


Lys 


Thr 


He 


Glu 


Arg 


Thr 


Thr 


Asn 


His 


Asp 


Val 


Lys 


Ala 


Val 










85, 










90 








95 




Glu 


Tyr 


Phe 


Leu 
100 


Lys 


Glu 


Lys 


Val 


Ala 
105 


Glu 


He 


Pro 


Glu 


Leu 
110 


His 


Ala 


Val 


Ser 


Glu 
115 


Phe 


He 


His 


Phe 


Ala 
120 


Cys 


Thr 


Ser 


Glu 


Asp 
125 


lie 


Asn 


Asn 


Leu . 


Ser 
130 


His 


Ala 


Leu 


Met 


Leu 
135 


Lys 


Thr 


Ala 


Arg 


Asp 
140 


Glu 


Val 


lie 


Leu 


Pro 


Tyr 


Trp 


Arg 


Gin 


Leu 


He 


Asp 


Gly 


lie 


Lys 


Asp 


Leu 


Ala 


Val 


Gin 


145 










150 










155 










160 


Tyr 


Arg 


Asp 


He 


Pro 
165 


Leu 


Leu 


Ser 


Arg 


Thr 
170 


His 


Gly 


Gin 


Pro 


Ala 
175 


Thr 


Pro 


Ser 


Thr 


He. 
180 


Gly 


Lys 


Glu 


Met 


Ala 
185 


Asn 


Val 


Ala 


Tyr 


Arg 
190 


Met 


Glu 


Arg 


Gin 


Tyr 


Arg 


Gin 


Leu 


Asn 


Gin 


Val 


Glu 


He 


Leu 


Gly 


Lys 


lie 


Asn 






195 










200 










205 






Gly 


Ala 


Val 


Gly 


Asn 


Tyr 


Asn 


Ala 


His 


He 


Ala 


Ala 


Tyr 


Pro 


Glu 


Val 




210 










215 










220 








Asp 


Trp 


His 


Gin 


Phe 


Ser 


Glu 


Glu 


Phe 


Val 


Thr 


Ser 


Leu 


Gly 


lie 


Gin 


225 










230 










235 








2 4 0. 


Trp 


Asn 


Pro 


Tyr 


Thr 


Thr 


Gin 


He 


Glu 


Pro 


His Asp 


Tyr 


lie 


Ala 


Glu 










245 










250 










255 




Leu 


Phe 


Asp 


Cys 


Val 


Ala 


Arg 


Phe 


Asn 


Thr 


He 


Leu 


lie 


Asp 


Phe 


Asp 








260 










265 










270 




Arg 


Asp 


Val 


Trp 


Gly 


Tyr 


He 


Ala 


Leu 


Asn 


His 


Phe 


Lys 


Gin 


Lys 


Thr 






275 










280 










285 






He 


Ala 


Gly 


Glu 


He 


Gly 


Ser 


Ser 


Thr 


Met 


Pro 


His 


Lys 


Val 


Asn 


Pro 




290 










295 










300 








He 


Asp 


Phe 


Glu 


Asn 


Ser 


Glu 


Gly 


Asn 


Leu 


Gly 


Leu 


Ser 


Asn 


Ala 


Val 


305 










310 










315 










320 


Leu 


Gin 


His 


Leu 


Ala 


Ser 


Lys 


Leu 


Pro 


Val 


Ser 


Arg 


Trp 


Gin 


Arg 


Asp 
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325 330 335 

Leu Thr Asp Ser Thr Val Leu Arg Asn Leu Gly Val Gly He Gly Tyr 

340 " 345 350 

Ala Leu He Ala Tyr Gin Ser Thr Leu Lys Gly Val Ser Lys Leu Glu 

355 360 365 

Val Asn Arg Asp His Leu Leu Asp Glu Leu Asp His Asn Trp Glu Val 

370 375 380 

Leu Ala Glu Pro He Gin Thr Val Met Arg Arg Tyr Gly He Glu Lys 
385 390 395 ' 400 

Pro Tyr Glu Lys Leu Lys Glu Leu Thr Arg Gly Lys Arg- Val Asp Ala 

4 05 4 10 415 

Glu Gly Met Lys Gin Phe He Asp Gly Leu Ala Leu Pro Glu Glu Glu 

420 425 430 

Lys Ala Arg Leu Lys Ala Met Thr Pro Ala Asn Tyr lie Gly Arg Ala 

4 35 4 40 4 45 

He Thr Met Val Asp Glu Leu Lys 
450 455 

<210> 357 

<211> 61 

<212> PRT 

<213> E . Coli 

<400> 357 

Met Leu He Leu Thr Arg Arg Val Gly Glu Thr Leu Met He Gly Asp 

1 5 10 15 

Glu Val Thr Val Thr Val Leu Gly Val Lys Gly Asn Gin Val Arg lie 

20 25 30 

Gly Val Asn Ala Pro Lys Glu Val Ser Val His Arg Glu Glu He Tyr 

35 40 45 

Gin Arg He Gin . Ala, Glu Lys Ser Gin Gin Ser Ser Tyr 
50 55 60 

<210> 358 

<211> 83 

<212> RNA 

<213> E. Coli 

<400> 358 

ggugaggugg ccgagaggcu gaaggcgcuc cccugcuaag ggaguaugcg gucaaaagcu 60 
gcauccgggg uucgaauccc cgccucaccg cca 83 



<210> 359 

<211> 200 

<212> PRT 

<213> E. Coli 

<400> 359 
Meu Lys Asn Lys Ala 
' 1 5 
Glu He Glu Ser Leu 
20 

Arg Asn Tyr Cys Leu 
35 

Ser Glu He Cys Arg 
50 

Cys lie Tyr He His 
65 



Asp Asn Lys Lys Arg Asn 
10 

Leu Lys Ala Ala Asn Thr 
25 

Thr Leu Leu Cys Phe He 
40 

Leu Arg lie Ser Asp lie 
55 

Arg Leu Lys Lys Gly Phe 
70 ' 75 



Phe Leu Thr His Ser 
15 

Gly Pro His Ala Ala 
30 

His Gly Phe Arg Ala 
45 

Asp Leu Lys Ala Lys- 
60 

Ser Thr Thr His Pro 
80 
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Leu Leu Asn Lys Glu Val Gin Ala Leu Lys Asn Trp Leu Ser lie Arg 

85 90 ■ ~ 95 

Thr Ser'Tyr Pro His Ala Glu Ser Glu Trp Val Phe Leu Ser Arg Lys 

100 105 no 

Gly Asn Pro Leu Ser Arg Gin Gin Phe Tyr His He He Ser Thr Ser 

115 120 125 

Gly Gly *Asn Ala Gly Leu Ser Leu Glu He His Pro His Met Leu Arg 

130 135 140 

His Ser Cys Gly Phe Ala Leu Ala Asn Met Gly He Asp Thr Arg Leu 
145 150 155 . 160 

He Gin Asp Tyr Leu .Gly His Arg Asn lie Arg His Thr Val Trp Tyr 

165 170 175 

Thr Ala Ser Asn Ala Gly Arg Phe Tyr Gly He Trp Asp Arg Ala Arg 

180 185 . 190 

Gly Arg .Gin' Arg His Ala Val Leu 
195 200 



<210> 360 
<211> 198 
<212> PRT 
<213> E. Coli 

<400> 360 

Met Ser Lys Arg Arg Tyr Leu Thr Gly Lys Glu Val Gin Ala Met Met 

1 5 10 15 

Gin Ala Val Cys Tyr Gly Ala Thr Gly Ala Arg Asp Tyr Cys Leu lie 

20 25 30 

Leu Leu Ala Tyr Arg His Gly Met Arg He Ser Glu Leu Leu Asp Leu 

35 40 4 5 

His Tyr Gin Asp Leu Asp Leu Asn Glu Gly Arg He Asn lie Arg Arg 

50 . : < 55 60 

Leu Lys Asn Gly Phe Ser Thr Val His Pro Leu Arg Phe Asp Glu Airg 
65 70 75 ' 80 

Glu Ala Val Glu Arg Trp Thr Gin Glu Arg Ala Asn Trp Lys Gly Ala 

85 90 95 

Asp Arg Thr Asp Ala He Phe He Ser Arg Arg Gly Ser Arg Leu Ser 

100 105 HO 

Arg Gin Gin Ala Tyr Arg He lie Arg Asp Ala Gly He Glu Ala Gly 

115 120 125 

Thr Val Thr Gin Thr His Pro His Met Leu Arg His Ala Cys Gly Tyr 

130 135 140 

Glu Leu Ala Glu Arg Gly Ala Asp Thr Arg Leu lie Gin Asp Tyr Leu. 
I 45 150 155 160 

Gly His Arg Asn lie Arg His Thr Val Arg Tyr Thr Ala Ser Asn Ala 

!65 , .170 175 

Ala Arg Phe Ala Gly Leu Trp Glu Arg Asn Asn Leu lie Asn Glu Lys 

180 185 190 

Leu Lys Arg Glu Glu Val 
195 



<210> 361 
<211> 182 
<212> PRT 
<213> E. Coli 



<400> 361 

Met Lys lie Lys Thr Leu Ala lie Val Val Leu Ser Ala Leu Ser Leu 
1 5 10 15 
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Ser Ser Thr Ala Ala Leu Ala Ala Ala Thr Thr Val Asn Gly Gly Thr 

20 25 30 

Val His Phe Lys Gly Glu Val Val Asn Ala Ala Cys Ala Val Asp Ala 

35 .AO 45 

Gly Ser Val Asp Gin Thr Val Gin Leu Gly Gin Val Arg Thr Ala Ser 

50 55 60 

Leu Ala Gin Glu Gly Ala Thr Ser Ser Ala Val Gly Phe Asn He Gin 
65 70 75 ' 80 

Leu Asn Asp Cys Asp Thr Asn Val Ala Ser Lys Ala Ala Val Ala Phe 

85 90 95 

Leu Gly Thr Ala He Asp Ala Gly His Thr Asn Val Leu Ala Leu Gin 

100 105 no 

Ser Ser Ala Ala Gly Ser Ala Thr Asn Val Gly Val Gin He Leu Asp 

115 120 125 

Arg Thr Gly Ala Ala Leu Thr Leu Asp Gly Ala Thr Phe Ser Ser Glu 

130 135 140 

Thr Thr Leu Asn Asn Gly Thr Asn Thr He Pro Phe Gin Ala Arq Tyr 
145 150 155 160 

Phe Ala Thr Gly Ala Ala Thr Pro Gly Ala Ala Asn Ala Asp Ala Thr 

165 170 175 

Phe Lys Val Gin Tyr Gin 
180 



<210> 362 

<211> 215 

<212> PRT 

<213> E. Coli 

<400> 362 

Met Leu Leu Met Arg Met Arg Pro Ser Arg Phe Ser He Asn Asn Leu 

1 5, 10 15 

Pro Arg Phe Arg Asp Val He Thr Gly Arg Asp Ala His Pro Cys Ala 

20 25 30 

He Lys He Thr Met Lys Arg Lys Arg Leu Phe Leu Leu Ala Ser Leu 

35 40 45 

Leu Pro Met Phe Ala Leu Ala Gly Asn Lys Trp Asn Thr Thr Leu Pro 

50 55 . 60 

Gly Gly Asn Met Gin Phe Gin Gly Val He lie Ala Glu Thr Cys Arq 
65 7 0 75 so 

He Glu Ala Gly Asp Lys Gin Met Thr Val Asn Met Gly Gin He Ser 

85 90 - 95 

Ser Asn Arg Phe His Ala Val Gly Glu Asp Ser Ala Pro Val Pro Phe 

100 105 no 

Val He His Leu Arg Glu Cys Ser Thr Val Val Ser Glu Arq Val Glv 

115 120 125 

Val Ala Phe His Gly Val Ala Asp Gly Lys Asn Pro Asp Val Leu Ser 

130 135 140 

Val Gly Glu Gly Pro Gly lie Ala Thr Asn lie Gly Val Ala Leu Phe 
145 150 155 ' 160. 

Asp Asp Glu Gly Asn Leu Val Pro lie Asn Arg Pro Pro Ala Asn Trp 

165 170 ' 175 

Lys Arg Leu Tyr Ser Gly Ser Thr Ser Leu His Phe lie Ala Lys Tyr 

180 185 190 

Arg Ala Thr Gly Arg Arg Val Thr Gly Gly lie Ala Asn Ala Gin Ala 

195 200 205 

Trp Phe Ser Leu Thr Tyr Gin 
210 215 
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<210> 363 
<211> 241 
<212> PRT 
.<213> E. Coli 

<400> 363 

Met Ser Asn Lys Asn Val Asn Val Arg Lys Ser Gin Glu lie Thr Phe 

1 5 .10 . 15 

Cys Leu Leu Ala Gly He Leu Met Phe Met Ala Met Met Val Ala Gly 

20 25 30 

Arg Ala Glu Ala Gly Val Ala Leu Gly Ala Thr Arg Val He Tyr Pro 

35 40 45 

Ala Gly Gin Lys Gin Glu Gin Leu Ala Val Thr Asn Asn Asp Glu Asn 

50 55 60 

Ser Thr Tyr Leu He Gin. Ser Trp Val Glu Asn Ala Asp Gly Val Lys 
" 70 75 ~ ' 80 

Asp Gly Arg Phe lie Val Thr Pro Pro Leu Phe Ala Met Lys Gly Lys 

85 90 95 

Lys Glu Asn Thr Leu Arg He Leu Asp Ala Thr Asn Asn Gin Leu Pro 

100 105 ' no' 

Gin Asp Arg Glu Ser Leu Phe Trp Met Asn Val .Lys Ala He Pro Ser 

115 120 125 

Met Asp Lys Ser Lys' Leu Thr Glu Asn Thr Leu Gin Leu Ala lie He 

130 135 140 

Ser Arg He Lys Leu Tyr Tyr Arg" Pro Ala Lys Leu Ala Leu Pro Pro 
145 _ 150 155 160 

Asp Gin Ala Ala Glu Lys Leu Arg Phe Arg Arg Ser Ala Asn Ser Leu 

165 170 175 

Thr Leu He Asn Pro Thr Pro Tyr Tyr Leu Thr Val Thr Glu Leu Asn 

180 185 190 

Ala Gly Thr Arg Val Leu Glu Asn Ala Leu Val Pro Pro Met Gly Glu 

195 •' 200 205 

Ser Thr Val Lys Leu Pro Ser Asp Ala Gly Ser Asn lie Thr Tyr Arg 

210 215 220 

Thr lie Asn Asp Tyr Gly' Ala Leu Thr Pro Lys Met Thr Gly Val Met 
225 230 235 ~ 240 

Glu 



<210> 364 
<211> 878 
<212> PRT 
<213> E. Coli 





<400> 


364 


























Met 


Ser 


Tyr 


Leu 


Asn 


Leu 


Arg 


Leu 


Tyr 


Gin Arg 


Asn 


Thr 


Gin 


Cys 


Leu 


1 








5 










10 










15 




His 


lie 


Arg 


Lys 


His 


Arg 


Leu 


Ala 


Gly 


Phe 


Phe 


Val 


Arg 


Leu 


Val 


Val 








20 










25 








30 






Ala 


Cys 


Ala 


Phe 


Ala 


Ala 


Gin 


Ala 


Pro 


Leu 


Ser 


Ser 


Ala 


Asp 


Leu 


Tyr 






35 










40 










45 




Phe 


Asn 


Pro 


Arg 


Phe 


Leu 


Ala . 


Asp 


Asp 


Pro 


Gin 


Ala 


Val 


Ala 


Asp 


Leu 




50 










55 










60 








Ser 


Arg 


Phe 


Glu 


Asn 


Gly 


Gin 


Glu 


Leu 


Pro 


Pro 


Gly 


Thr 


Tyr 


Arg 


Val 


65 










70 










75 






80 


Asp 


He 


Tyr 


Leu 


Asn 


Asn 


Gly 


Tyr 


Met 


Ala 


Thr 


Arg 


Asp 


Val 


Thr 


Phe 










85 










90 










95 




Asn 


Thr 


Gly 


Asp 


Ser 


Glu 


Gin 


Gly 


He 


Val 


Pro 


Cys 


Leu 


Thr 


Arg 


Ala 








100 










105 










110 




Gin 


Leu 


Ala 


Ser 


Met 


Gly 


Leu 


Asn 


Thr 


Ala 


Ser 


Val 


Ala 


Gly 


Met 


Asn 
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115 120 125 

Leu Leu Ala Asp Asp Ala Cys Val Pro Leu Thr Thr Met Val Gin Asp 

130 135 140 

Ala Thr Ala His Leu Asp Val Gly Gin Gin Arg Leu Asn Leu Thr He 
145 150 155 160 

Pro Gin Ala Phe Met Ser Asn Arg Ala Arg Gly Tyr He Pro Pro Glu 

165 170 ' 175 

Leu Trp Asp Pro Gly He Asn Ala Gly Leu Leu Asn Tyr Asn Phe Ser 
180 185 190 

, Gly Asn Ser Val Gin Asn Arg He Gly Gly Asn Ser His Tyr Ala Tyr 
195 200 205 

Leu Asn Leu Gin Ser Gly Leu Asn He Gly Ala Trp- Arg Leu Arg Asp 

210 215 220 

Asn Thr Thr Trp Ser Tyr Asn Ser Ser Asp Arg Ser Ser Gly Ser Lys 
225 230 235 240 

Asn Lys Trp Gin His lie Asn Thr Trp Leu Glu Arg Asp He He Pro 

24 5 250 ' 255 

Leu Arg Ser Arg Leu Thr Leu Gly Asp Gly Tyr Thr Gin Gly Asp He 

260 265 270 

Phe Asp Gly lie Asn Phe Arg Gly Ala Gin Leu Ala Ser Asp Asp Asn 

275 280 285 

Met Leu Pro Asp Ser Gin Arg Gly Phe Ala Pro Val lie His Gly lie 

2 90 2 95 300 

Ala Arg Gly Thr Ala Gin Val Thr lie Lys Gin Asn Gly Tyr Asp lie 
305 310 315 320 

Tyr Asn Ser Thr Val Pro Pro Gly Pro Phe Thr lie Asn Asp lie Tyr 

325 330 335 

Ala Ala Gly Asn Ser Gly Asp Leu Gin Val Thr lie Lys Glu Ala Asp 

34 0 34 5 350 

Gly Ser Thr Gin lie Phe Thr Val Pro Tyr Ser Ser Val Pro Leu Leu 

355 360 365 

Gin Arg Glu Gly His Thr Arg Tyr Ser lie Thr Ala Gly Glu Tyr Arq 

370 375 380 

Ser Gly Asn Ala Gin Gin Glu Lys Thr Arg Phe Phe Gin Ser Thr Leu 
385 390 395 400 

Leu His Gly Leu Pro Ala Gly Trp Thr lie Tyr Gly Gly Thr Gin Leu 

405 . 410 " 415 

Ala Asp Arg Tyr Arg Ala Phe Asn Phe Gly lie Gly Lys Asn Met Gly 

420 425 430 

Ala Leu Gly Ala Leu Ser Val Asp Met Thr Gin Ala Asn Ser Thr Leu 

435 440 445 

Pro Asp Asp Ser Gin His Asp Gly Gin Ser Val Arg Phe Leu Tyr Asn 

450 455 460 

Lys Ser Leu Asn Glu Ser Gly Thr Asn lie Gin Leu Val Gly Tyr Arg 
465 470 475 480 

Tyr Ser Thr Ser Gly Tyr Phe Asn Phe Ala Asp Thr Thr Tyr Ser Arg 

4 85 .4 90 4 95 

Met Asn Gly Tyr Asn He' Glu Thr Gin Asp Gly Val He Gin Val Lys 

500 505 510 

Pro Lys Phe Thr Asp Tyr Tyr Asn Leu Ala Tyr Asn Lys Arg Gly Lys 

515 520 525 

Leu Gin Leu" Thr Val Thr Gin Gin Leu Gly Arg Thr Ser Thr Leu Tyr 

530 535 540 

Leu Ser Gly Ser His Gin Thr Tyr Trp Gly Thr Ser Asn Val Asp Glu 
545 550 555 560 

Gin Phe Gin Ala Gly Leu Asn Thr Ala Phe Glu . Asp lie Asn Trp Thr 

565 570 575 

Leu Ser Tyr Ser Leu Thr Lys Asn Ala Trp Gin Lys Gly Arg Asp Gin 

580 585 590 

Met Leu Ala Leu Asn Val Asn lie Pro Phe Ser His Trp Leu Arg Ser 
595 600 605 
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Asp Ser Lys Ser Gin Trp Arg His Ala Ser Ala Ser Tyr Ser Met Ser 

610 615 - 620 

His Asp Leu Asn Gly Arg Met Thr Asn Leu Ala.Gly Val Tyr Gly Thr 
625 630 635 64 0 

Leu Leu Glu Asp Asn Asn Leu Ser Tyr Ser Val Gin Thr Gly Tyr Ala 

• 645 650 655 

Gly Gly Gly Asp Gly Asn Ser Gly Ser Thr Gly Tyr Ala Thr Leu Asn 

660 665 670 

Tyr Arg Gly Gly Tyr Gly Asn Ala Asn He Gly Tyr Ser His Ser Asp 

675 680 685 

Asp lie Lys Gin Leu Tyr Tyr Gly Val Ser Gly Gly Val Leu Ala His 

690 695 700 

Ala Asn Gly Val Thr Leu Gly Gin Pro Leu Asn Asp Thr Val Val Leu 
705 710 715 . 720 

Val Lys Ala Pro Gly Ala Lys Asp Ala Lys Val Glu Asn Gin Thr Gly 

725 730 735 

Val Arg Thr Asp Trp Arg Gly Tyr Ala Val Leu Pro Tyr Ala Thr Glu 

. 740 745 750 

Tyr Arg Glu Asn Arg Val Ala Leu Asp Thr Asn Thr Leu Ala Asp Asn 

.755 760 765 

Val Asp Leu Asp Asn Ala Val Ala Asn Val Val Pro Thr Arg Gly Ala 

"770 775 780 

He Val Arg Ala Glu Phe Lys Ala Arg Val Gly He Lys Leu Leu Met 
7 85 790 ' 795 " 800 

Thr Leu Thr His Asn Asn Lys Pro Leu Pro Phe Gly Ala Met Val Thr 

805 . 810 815 

Ser Glu Ser Ser Gin Ser Ser Gly He Val Ala Asp Asn Gly Gin Val 

820 825 830 

Tyr Leu Ser Gly Met Pro Leu Ala Gly Lys Val Gin Val Lys Trp Gly 

835 840 845 

Glu. Glu Glu Asn Ala His Cys Val Ala Asn Tyr Gin Leu Pro Pro Glu 

850 855 860 

Ser Gin Gin Gin Leu Leu Thr Gin Leu Ser Ala Glu Cys Arg 
865 870 875 



<210> 365 

<211> 176 

<212> PRT 

<213> E . Coli 

<400> 365 

Met Arg Asn Lys Pro Phe Tyr Leu Leu Cys Ala Phe Leu Trp Leu Ala 

5 10 15 

Val Ser His Ala Leu Ala Ala Asp Ser Thr lie Thr He Arg Gly Tyr 

20 25 30 

Val Arg Asp Asn Gly Cys Ser Val Ala Ala Glu Ser Thr Asn Phe Thr 

35 40 45 

Val Asp Leu Met Glu Asn Ala Ala Lys Gin Phe Asn Asn He Gly Ala 

50 55 60 

Thr Thr Pro Val Val Pro Phe Arg He Leu Leu Ser Pro Cys Gly Asn 
65 70 75 80 

Ala Val Ser Ala Val Lys Val Gly Phe Thr Gly Val Ala Asp Ser His 

85 90 95 

Asn Ala Asn Leu Leu Ala Leu Glu Asn Thr Val Ser Ala Ala Ser Gly 

100 105 110 

Leu Gly He Gin Leu Leu Asn Glu Gin Gin Asn Gin He Pro Leu Asn 

115 120 125 

Ala Pro Ser Ser Ala Leu Ser Trp Thr Thr Leu Thr Pro Gly Lys Pro 

130. 135 140 

Asn Thr Leu Asn Phe Tyr Ala Arg Leu Met Ala Thr Gin Val Pro Val 
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145 150 155 160 

Thr Ala Gly His He Asn Ala Thr Ala Thr Phe Thr Leu Glu Tyr Gin 
165 170 • 175 



<210> 366 
<211>. 167 
<212> PRT 
<213> E. Coli 

<400> 366 

Met Lys Trp Cys Lys Arg Gly Tyr Val Leu Ala Ala He Leu Ala Leu 
' 1 5 10 15 

Ala Ser Ala Thr He Gin Ala Ala Asp Val Thr He Thr Val Asn Gly 

20 25 30 

Lys Val Val Ala Lys Pro Cys Thr Val Ser Thr Thr Asn Ala Thr Val 

35 40 45 

Asp Leu Gly Asp Leu Tyr Ser Phe Ser Leu Met Ser Ala Gly Ala Ala 

50 55 60 

Ser Ala Trp His Asp Val Ala Leu Glu Leu Thr Asn Cys Pro Val Gly 
65 7 0 75 80 

Thr Ser Arg Val Thr Ala Ser Phe Ser Gly Ala Ala Asp Ser Thr Gly 

85 90 95 

Tyr Tyr Lys Asn Gin Gly Thr Ala Gin Asn He Gin Leu Glu Leu Gin 

100 105 no 

Asp Asp Ser Gly Asn Thr Leu Asn Thr Gly Ala Thr Lys Thr Val Gin 

115 120 125 

Val Asp Asp Ser Ser Gin Ser Ala His Phe Pro Leu Gin Val Arg Ala 

130 135 140 

Leu Thr Val Asn Gly Gly Ala Thr Gin Gly Thr He Gin Ala Val He 
145 150 155 160 

Ser He Thr Tyr Thr .Tyr Ser 
165 



<210> 367 
<211> 300 
<212> PRT 
<213> E . Coli 

<400> 367 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 

1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 

20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val 

35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe 

50 5 5 60 

Cys His Asn Asp Tyr Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin 
65 70 75 " 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 

85 90 95, 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 

100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 

115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135' " 140 
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<212> PRT • 
<213> E. Coli 
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Val He Arg Arg Asp Gin Glu Ser Ala Leu Thr Cys Trp Gly He Glu 
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<210> 372 
<211> 71 
<212> PRT 
<213> E. Coli 

<400> 372 

Met Ser Asn Lys Met Thr Gly Leu Val Lys Trp Phe Asn Ala Asp Lys 

1 5 10 "15 

Gly Phe Gly Phe lie Ser Pro Val Asp Gly Ser Lys Asp Val Phe Val 
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20 

His Phe Ser Ala lie Gin 
35" 

Gin Lys Val Thr Phe Ser 
50 

Ala Asn Val lie lie Thr 
65 70 



25 

Asn Asp Asn Tyr Arg Thr 
4 0 

lie Glu Ser Gly Ala Lys 
55 60 
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4 5 
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<210> 373 
<211> 338 
<212> PRT 
<213> E. Coli 

<400> 373 

Met Phe Val He Trp Ser His Arg Thr Gly Phe He Met Ser His Gin 

15" 10 15 

Leu Thr Phe Ala Asp Ser Glu Phe Ser Ser Lys Arg Arg Gin Thr Arq 

20 25 " ~ 30 

Lys Glu He Phe Leu Ser Arg Met Glu Gin He Leu Pro Trp Gin Asn 

35 40 45 

Met Val Glu Val He Glu Pro Phe Tyr Pro Lys Ala Gly Asn Glv Ara 

50 55 60 

Arg Pro Tyr Pro Leu Glu Thr Met Leu Arg He His Cys Met Gin His 
65 70 75 " 80 

Trp Tyr Asn Leu Ser Asp Gly Ala Met . Glu Asp Ala Leu Tyr Glu He 

85 90 95 

Ala Ser Met Arg Leu Phe Ala Arg Leu Ser Leu Asp Ser Ala Leu Pro 

100 105 no 

Asp Arg Thr Thr He Met Asn Phe Arg His Leu Leu Glu Gin His Gin 

115 120 125 

Leu Ala Arg Gin Leu Phe Lys Thr He Asn Arg Trp Leu Ala Glu Ala 

130 135 140 

Gly Val Met Met Thr Gin Gly Thr Leu Val Asp Ala Thr He He Glu 
145 150 155 160 

Ala Pro Ser Ser Thr Lys Asn Lys Glu Gin Gin Arg Asp Pro Glu Met 

165 170 ~ 175 

His Gin Thr Lys Lys Gly Asn Gin Trp His Phe Gly Met Lys Ala His 

• 180 185 ' 190 

He Gly Val Asp Ala Lys Ser Gly Leu Thr His Ser Leu Val Thr Thr 

195 200 205 

Ala Ala Asn Glu His Asp Leu Asn Gin Leu Gly Asn Leu Leu His Gly 

210 215 " 220 

Glu Glu Gin Phe Val Ser Ala Asp Ala Gly Tyr Gin Gly Ala Pro Gin 
225 230 235 " 240 

Arg Glu Glu Leu Ala Glu Val Asp Val Asp Trp Leu He Ala Glu Arg 

245 250 255 

Pro Gly Lys Val Arg Thr Leu Lys Gin His Pro Arg Lys Asn Lys Thr 

260 265 270 

Ala lie Asn He Glu Tyr Met Lys Ala Ser lie Arg' Ala Arg Val Glu 

275 280 285 

His Pro Phe Arg He lie Lys Arg Gin Phe Gly Phe Val Lys Ala Arg 

290 295 300 

Tyr Lys Gly Leu Leu Lys Asn Asp Asn Gin Leu Ala Met Leu Phe Thr 
305 310 • 315 320 

Leu Ala Asn Leu Phe Arg Ala Asp Gin Met lie Arg Gin Trp Glu Arg 
325 330 335 

Ser His 
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<210> 374 

<211> 157 

<212> PRT 

<213> E . Coli 

<400> 374 

Met Val Tyr lie He He Val Ser His Gly His Glu Asp Tyr He Lys 

1 5 .10 15 

Lys Leu Leu Glu Asn Leu Asn Ala Asp Asp Glu His Tyr Lys He He 

20 25 30 

Val Arg Asp Asn Lys Asp Ser Leu Leu Leu Lys Gin He Cys Gin His 

35 -40 45 

Tyr Ala Gly Leu Asp Tyr He Ser Gly Gly Val Tyr Gly Phe Gly His 

50 55 60 

Asn Asn Asn He Ala Val Ala Tyr Val Lys Glu Lys Tyr Arg Pro" Ala 
65 70 75 " 80 

Asp Asp Asp Tyr He Leu Phe Leu Asn Pro Asp He He Met Lys His 

85 90 95 

Asp Asp Leu Leu Thr Tyr He Lys Tyr Val Glu Ser Lys Arg Tyr Ala 

100 105 no 

Phe Ser Thr Leu Cys Leu Phe Arg Asp Glu Ala Lys Ser Leu His Asp 

115 120 125 

Tyr Ser Val Arg Lys ' Phe Pro Val Leu Ser Asp Phe He Val Ser Phe 

130 135 140 

Met Leu Gly He Lys Glu Gly Ala Asn Lys Ser Leu He 
145 150 155 



<210> 375 

<211> 372 

<212> PRT 

<213> E . Coli 

<400> 375 

Met Gly Lys Ser lie Val Val Val Ser Ala Val Asn Phe Thr Thr Gly 

1 5 10 . 15 

Gly Pro Phe Thr lie Leu Lys Lys Phe Leu Ala Ala Thr Asn Asn Lys 

20 25 30 

Glu Asn Val Ser Phe lie Ala Leu Val His Ser Ala Lys Glu Leu Lys 

" 35 40 . 45 

Glu Ser Tyr Pro Trp Val Lys Phe lie Glu Phe Pro Glu Val Lys Gly 

50 55 .60 

Ser Trp Leu Lys Arg Leu His Phe Glu Tyr Val Val Cys Lys Lys Leu 

65 70 75 ' 80 

Ser Lys Glu Leu Asn Ala Thr His Trp lie Cys Leu His Asp lie Thr 

85 90 95 

Ala Asn Val Val Thr Lys Lys Arg Tyr Val Tyr Cys His Asn Pro Ala 

100 105 HO 

Pro Phe Tyr Lys Gly lie Leu Phe Arg Glu He Leu Met Glu Pro Ser 

115 120 125 

Phe Phe Leu -Phe Lys Met Leu Tyr Gly Leu lie Tyr Lys lie Asn He 

130 135 140 

Lys Lys Asn Thr Ala Val Phe Val Gin Gin Phe Trp Met Lys Glu Lys 

I 45 150 155 ^ 160 

Phe lie Lys Lys Tyr Ser lie Asn Asn He lie Val Ser Arg Pro Glu 

165- 170 175 

lie Lys Leu Ser Asp Lys Ser Gin .Leu Thr Asp Asp Asp Ser Gin Phe 

. 180 185 190 

Lys Asn Asn Pro Ser Glu Leu Thr lie Phe Tyr Pro Ala Val Pro Arg 
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195 200 205 

Val Phe Lys Asn Tyr Glu Leu lie lie Ser Ala Ala Arg Lys Leu Lvs 

210 215 220 

Glu Gin Ser Asn lie Lys Phe Leu Leu Thr lie Ser Gly Thr Glu Asn 
225 230 235 " 240 

Ala Tyr Ala Lys Tyr He He Ser Leu Ala Glu Gly Leu Asp Asn Val 

2 ^5 250 255 

His Phe Leu Gly Tyr . Leu Asp Lys Glu Lys He Asp His Cys Tyr Asn 

260 265 1 270 

He Ser Asp He Val Cys Phe Pro Ser Arg Leu Glu Thr Trp Gly Leu 

275 280 285 

Pro Leu Ser Glu Ala Lys Glu Arg Gly Lys Trp Val Leu Ala Ser Asp 

290 295 300 

Phe Pro Phe Thr Arg Glu Thr Leu Gly Ser Tyr Glu Lys Lys Ala Phe 
305 310 315 " " 320 

Phe Asp Ser Asn Asn Asp Asp Met Leu Val Lys Leu He He Asp Phe 

325 330 ^ 335 

Lys Lys Gly Asn Leu Lys Lys Asp He Ser Asp Ala Asn Phe He Tyr 

340 345 350 

Arg Asn Glu Asn Val Leu Val Gly Phe Asp Glu Leu Val Asn Phe He 

355 360 365 

Thr Glu Glu His 
370 



<210> 376 

<211> 196 

<212> PRT 

<213> £. Coli 

<400> 376 

Met He Leu Lys Leia Ala Lys Arg Tyr Gly Leu Cys Gly Phe He Arg 

1 5 10 15 

Leu Val Arg Asp Val Leu Leu Thr Arg Val Phe Tyr Arg Asn Cys Arg 

20 25 30 

He He Arg Phe Pro Cys Tyr He Arg Asn Asp Gly Ser He Asn Phe 

• 35 40 45 

Gly Glu Asn Phe Thr Ser Gly Val Gly Leu Arg Leu Asp Ala Phe Glv 

50 55 60 

Arg Gly Val He Phe Phe Ser Asp Asn Val Gin Val Asn Asp Tyr Val 
65 70 75 80 

His He Ala Ser He. Glu Ser Val Thr He Gly Arg Asp Thr Leu He 

85 90 95 

Ala Ser Lys Val Phe He Thr Asp His Asn His Gly Ser Phe Lys His 

100 105 no 

Ser Asp Pro Met Ser Ser Pro Asn He Pro Pro Asp Met Arg Thr Leu. 

115 120 125 

Glu Ser Ser Ala Val Val He Gly Gin Arg Val Trp Leu Gly Glu Asn 

130 135 140 

Val Thr Val Leu Pro Gly Thr He He Gly Asn Gly Val Val Val Gly 
I 45 150 155 160 

Ala Asn Ser Val Val Arg Gly Ser He Pro Glu Asn Thr Val He Ala 

165 170 175 

Gly Val Pro Ala Lys He He Lys Lys Tyr Asn His Glu Thr Lys Leu 

180 185 190 

Trp Glu Lys Ala 
195 

<210> 377 
<211> 330 
<212> PRT 
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<213> E. Coli 
<400> 377 

Met Tyr Phe Leu Asn Asp Leu Asn Phe Ser Arg Arg Asp Ala Gly Phe 

1 ; 5 . 10 15 

Lys Ala Arg Lys Asp Ala Leu Asp lie Ala Ser Asp Tyr Glu Asn lie 

.20 25 30 

Ser Val Val Asn lie Pro- Leu Trp Gly Gly Val Val Gin Arg lie lie 

35 40 45 

Ser Ser Val Lys Leu. Ser Thr Phe Leu Cys Gly Leu Glu Asn Lys Asp 

50 5 5 * 60 

Val Leu lie Phe Asn Phe Pro Met Ala Lys Pro Phe Trp His lie Leu 
65 70 75 80 

Ser Phe Phe His Arg Leu Leu Lys Phe Arg lie Val Pro Leu lie His 

85 90 95 

Asp He Asp Glu Leu Arg Gly Gly Gly Gly Ser Asp Ser Val Arg Leu 

100 105 HO 

Ala Thr Cys Asp Met Val He Ser His Asn Pro Gin Met Thr Lys Tyr 

115 120 125 

Leu Ser Lys Tyr Met Ser Gin Asp Lys He Lys Asp He Lys He Phe 

130 135 140 

Asp Tyr Leu Val Ser Ser Asp Val Glu His Arg Asp Val Thr Asp Lys 
145 150 155 * 160 

Gin Arg Gly Val He Tyr Ala Gly Asn Leu Ser Arg His Lys Cys Ser 

165 170 175 

Phe lie Tyr Thr Glu Gly Cys Asp Phe Thr Leu Phe Gly Val Asn Tyr 

180 185 190 

Glu Asn Lys Asp Asn Pro Lys Tyr Leu Gly Ser Phe Asp Ala Gin Ser 

195 200 205 

Pro Glu Lys lie Asn Leu Pro Gly Met Gin Phe Gly Leu He Trp Asp 

210 215 220 

Gly Asp Ser Val Glu' : Thr Cys Ser Gly Ala Phe Gly Asp Tyr Leu Lys 
225 230 235 240 

Phe Asn Asn Pro His Lys Thr Ser Leu Tyr Leu Ser Met Glu Leu Pro 

245 250 255 

Val Phe He Trp Asp Lys Ala Ala Leu Ala Asp Phe He Val Asp Asn 

260 265 * 270 

Arg He Gly Tyr Ala Val Gly Ser He Lys Glu Met Gin Glu He Val 

275 280 285 

Asp Ser Met Thr He Glu Thr Tyr Lys Gin lie Ser Glu Asn Thr Lys 

290 295 300 

lie lie Ser Gin Lys lie Arg Thr Gly Ser Tyr Phe Arg Asp Val Leu 
305 310 315 * 320 

Glu Glu Val lie Asp Asp Leu Lys Thr Arg 
325 330 



<210>' 378 
<211> 388 
<212> PRT 



<400> 378 
Met lie Tyr Leu Val lie Ser Val 

1 5 
Leu Tyr Leu Lys Lys Asp lie Phe 
2 0 

He Phe Ala Leu Val Leu Leu Gly 

35 40 
Ala Phe Gin Leu Asn Asd Ala Thr 
50 55 



Phe Leu lie Thr Ala Phe lie Cys 

10 * 15 

Tyr Pro Ala Val Cys. Val Asn lie 
25 30 
Tyr Glu He Thr Ser Asp He Tyr 
45 

Leu lie Phe Leu Leu Cys Asn Val 
60 
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Leu Thr Phe Thr Leu Ser Cys Leu Leu Thr Glu Ser Val Leu Asp Leu 
65 70 75 80 

Asn He Arg Lys Val Asn Asn Ala He Tyr Ser He Pro Ser Lys Lys 

8 5 90 95 

Val His Asn Val Gly Leu Leu Val He . Ser Phe Ser Met He Tyr He 

100 105 no 

Cys Met- Arg Leu Ser Asn Tyr Gin Phe Gly Thr Ser Leu Leu Ser Tyr 

115 120 125 

Met Asn Leu He Arg Asp Ala Asp Val Glu Asp Thr Ser Arg Asn Phe 

130 135 i 4 o 

Ser Ala Tyr Met Gin Pro He lie Leu Thr Thr Phe Ala Leu Phe He 
145 "0 155 160 

Trp Ser Lys Lys ■ Phe Thr Asn Thr Lys Val Ser Lys Thr Phe Thr Leu 

16 5 170 175 

Leu Val Phe He Val Phe lie Phe Ala He He Leu Asn Thr Gly Lys 

180 185 190 

Gin He Val Phe Met Val He lie Ser Tyr Ala Phe lie Val Gly Val 

195 200 205 

Asn Arg Val Lys His Tyr Val Tyr Leu lie Thr Ala Val Gly Val Leu 

210 215 220 

Phe Ser Leu Tyr Met Leu Phe Leu Arg Gly Leu Pro Gly Gly Met Ala 
225 230 235 240 

Tyr Tyr Leu Ser Met Tyr Leu Val Ser Pro He lie Ala Phe Gin Glu 

245 ■ 250 255 

Phe Tyr Phe Gin Gin Val Ser Asn Ser Ala Ser Ser His Val Phe Trp 

260 265 270 

Phe Phe Glu Arg Leu Met Gly Leu Leu Thr Gly Gly Val Ser Met Ser 

275 280 285 

Leu His Lys Glu Phe Val Trp Val Gly Leu Pro Thr Asn Val Tyr Thr 

290 295 300 

Ala Phe Ser Asp Tyr Val Tyr lie Ser Ala Glu Leu Ser Tyr Leu Met 
305 ,310 315 ~ 320 

Met Val lie His Gly Cys lie Ser Gly Val Leu Trp Arg Leu Ser Arg 

325 330 335 

Asn Tyr lie Ser Val Lys lie Phe Tyr Ser Tyr Phe lie Tyr Thr Phe 

340 345 350 

Ser Phe lie Phe Tyr His Glu Ser Phe Met Thr Asn lie Ser Ser Trp 

355 360 365 

lie Gin lie Thr Leu Cys lie lie Val Phe Ser Gin Phe Leu Lys Ala 

370 375 380 

Gin Lys lie Lys 
385 



<210> 379 
<211> 367 
<212> PRT 
<213> E. Coli 



<400> 379 

Met Tyr Asp Tyr lie lie Val Gly Ser Gly Leu Phe Gly Ala Val Cys 
- 1 5 10 15 

Ala Asn Glu Leu Lys Lys Leu Asn Lys Lys Val Leu Val lie Glu Lvs 

20 25 30 

Arg Asn His lie Gly Gly Asn Ala Tyr Thr Glu Asp Cys Glu Gly lie 

35 4 0 * 45 

Gin lie His Lys Tyr Gly Ala His lie Phe His Thr Asn Asp Lys Tyr 

50 55 60 

lie Trp Asp Tyr Val Asn Asp Leu Val Glu Phe Asn Arg Phe Thr Asn 
65 70 75 " 80 
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Ser Pro Leu Ala lie Tyr Lys Asp Lys Leu Phe Asn Leu Pro Phe Asn 

85 90 95 

Met Asn Thr Phe His Gin Met Trp Gly Val Lys Asp Pro Gin Glu Ala 

100 105 HO 

Gin Asn lie lie Asn Ala Gin Lys Lys Lys Tyr Gly Asp Lys Val Pro 

115 120 " " 125 

Glu Asn Leu Glu Glu Gin Ala lie Ser Leu Val Gly Glu Asp Leu Tyr 

130 135 140 

Gin Ala Leu lie Lys Gly Tyr Thr Glu Lys Gin Trp Gly Arg Ser Ala 
145 150 155 ~ 160 

Lys Glu Leu Pro Ala Phe lie lie Lys Arg He Pro Val Arg Phe Thr 

165 170 175 

Phe Asp Asn Asn Tyr Phe Ser Asp Arg Tyr Gin Gly He Pro Val Gly 

180 185 ~ 190 

Gly Tyr Thr Lys Leu He Glu Lys Met Leu Glu Gly Val Asp Val Lys 

195 200 205 

Leu Gly He Asp Phe Leu Lys Asp Lys Asp Ser Leu Ala Ser Lys Ala 

210 215 220 

His Arg He lie Tyr Thr Gly Pro He Asp Gin Tyr Phe Asp Tyr Arg 
225 230 235 240 

Phe Gly Ala Leu Glu Tyr Arg Ser Leu Lys Phe Glu Thr Glu Arg His 

245 250 255 

Glu Phe Pro Asn Phe Gin Gly Asn Ala Val He Asn Phe Thr Asp Ala 

260 265 270 

Asn Val Pro Tyr Thr Arg He He Glu His Lys His Phe Asp Tyr Val 

275 280 285 

Glu Thr Lys His Thr Val Val Thr Lys Glu Tyr Pro Leu Glu Trp Lys 

290 295 300 

Val Gly Asp Glu Pro Tyr Tyr Pro Val Asn Asp Asn Lys Asn Met Glu' 
305 310 315 320 

Leu Phe Lys Lys Tyr Arg Glu Leu Ala Ser Arg Glu Asp Lys Val He 

325' 330 335 

Phe Gly Gly Arg Leu Ala Glu Tyr Lys Tyr Tyr Asp Met His Gin Val 

340 345 * 350 

He Ser Ala Ala Leu Tyr Gin Val Lys Asn lie Met Ser Thr Asp 
355 360 365 



<210> 380 
<211> 371 
<212> PRT 
<213> E . Coli 

<400> 380 

Met Phe Pro Lys He Met Asn Asp Glu Asri Phe Phe Lys Lys Ala Ala 

1 5 10 15 

Ala His Gly Glu Glu Pro Pro Leu Thr Pro Gin Asn Glu His Gin Arg 

20 25 30 

Ser Gly Leu Arg Phe Ala Arg Arg Val Arg Leu Pro Arg Ala Val Gly 

35 40 45 

Leu Ala Gly Met Phe Leu Pro He Ala Ser Thr Leu Val Ser His Pro 

50 55 60 

Pro Pro Gly Trp Trp Trp Leu Val Leu Val Gly Trp Ala Phe Val Trp 
65 7 0 75 8 0 

Pro His Leu Ala Trp Gin He Ala Ser Arg Ala Val Asp Pro Leu Ser 

8 5 90 95 

Arg Glu lie Tyr Asn Leu Lys Thr Asp Ala Val Leu Ala Gly Met Trp 

100 % 105 110 

Val Gly Val Met Gly Val Asn Val Leu Pro Ser Thr Ala Met Leu Met 

115 120 125 

lie Met Cys Leu Asn Leu Met Gly Ala Gly Gly Pro Arg Leu Phe Val 
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130 135 140 

Ala Gly Leu Val Leu Met Val Val Ser Cys Leu Val Thr Leu Glu Leu 
145 150 155 160 

Thr Gly lie Thr Val Ser Phe Asn Ser Ala Pro Leu Glu Trp Trp Leu 

165 170 175 

Ser Leu Pro lie lie Val lie Tyr Pro Leu Leu Phe Gly Trp Val Ser 

180 185 190 

Tyr Gin Thr Ala Thr Lys Leu Ala Glu His Lys Arg Arg Leu Gin Val 

195 200 " 205 

Met" Ser Thr Arg Asp Gly Met Thr Gly Val Tyr Asn Arg Arg His Trp 

210 215 220 

Glu Thr Met Leu Arg Asn Glu Phe Asp Asn Cys Arg Arg His Asn Arg 
225 230 235 " " 240 

Asp Ala Thr Leu Leu lie lie Asp lie Asp His Phe Lys Ser lie Asn 

245 250 255 

Asp Thr Trp Gly His Asp Val Gly Asp Glu Ala lie Val Ala Leu Thr 

260 265 270 

Arg Gin Leu Gin lie Thr Leu Arg Gly Ser Asp Val lie Gly Arg Phe 

275 280 ' 285 

Gly Gly Asp Glu Phe Ala Val He Met Ser Gly Thr Pro Ala Glu Ser 

290 295 ' 300 

Ala He Thr Ala Met Leu Arg Val His Glu Gly Leu Asn Thr Leu Arg 
305 310 315 320 

Leu Pro Asn Thr Pro Gin Val Thr Leu Arg He Ser Val Gly Val Ala 

325 . 330 335 

Pro Leu Asn Pro Gin Met Ser His Tyr Arg Glu Trp Leu Lys Ser Ala 

340 345 350 

Asp Leu Ala Leu Tyr Lys Ala Lys Lys Ala Gly Arg Asn Arg Thr Glu 

355 ' 360 365 

Val Ala Ala 
370 

<210> 381 

<211> 467 

<212> PRT 

<213> E. Coli 

<400> 381 

Met Asp Val Asn Val Asp Gin Phe Asp Thr Glu Ala Phe Arg Thr Asp 

1 5 10 15 

Lys Leu Glu Leu Thr Ser Gly Asn He Ala Asp His Asn Gly Asn Val 

20 25 30 

Val Ser Gly Val Phe Asp He His Ser Ser Asp Tyr Val Leu Asn Ala 

35 .4 0 4 5 

Asp Leu Val Asn Asp Arg Thr Trp Asp Thr Ser Lys Ser Asn Tyr Gly 

50 55 60 

Tyr Gly He Val Ala Met Asn Ser Asp Gly His Leu Thr He Asn Gly 
65 70 75 80 

Asn Gly Asp Val Asp Asn Gly Thr Glu Leu Asp Asn Ser Ser Val Asp 

85 90 95 

Asn Val Val Ala Ala Thr Gly Asn Tyr Lys Val Arg He Asp Asn Ala 

100 105 HO 

Thr Gly Ala Gly Ala lie Ala Asp Tyr Lys Asp Lys Glu He He Tyr 

115 120 125 " 

Val Asn Asp Val Asn Ser Asn Ala Thr Phe Ser Ala Ala Asn Lys Ala 

130 135 140 

Asp Leu Gly Ala Tyr Thr Tyr Gin Ala Glu Gin Arg Gly Asn Thr Val 
145 150 155 " 160 

Val 'Leu Gin Gin Met Glu Leu Thr Asp Tyr Ala Asn Met Ala Leu Ser 

165 170 175 

He Pro Ser Ala Asn Thr Asn He Trp Asn Leu Glu Gin Asp Thr Val 
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180 










185 










190 






.Gly 


Thr 


Arg 


Leu 


Thr 


Asn 


Ser 


Arg 


His 


Gly 


Leu 


Ala 


Asp 


Asn 


Gly 


Gly 






195 










200 










205 






Ala 


Trp 


Val 


Ser 


Tyr 


Phe 


Gly 


Gly 


Asn 


Phe 


Asn 


Gly 


•Asp 


Asn 


Gly 


Thr 




210 










215 










220 








He 


Asn 


Tyr 


Asp 


Gin 


Asp 


Val 


Asn 


Gly 


He 


Met 


Val 


Gly 


Val 


Asp 


Thr 


225 










230 










235 










240 


Lys 


lie 


Asp 


Gly 


Asn 


Asn 


Ala 


Lys 


Trp 


He 


Val 


Gly 


Ala 


Ala 


Ala 


Gly 










245 










250 










255 




Phe 


Ala 


Lys 


Gly 


Asp 


Met 


Asn 


Asp 


Arg 


Ser 


Gly 


Gin 


Val 


Asp 


Gin 


Asp 








260 










265 










270 




Ser 


Gin 


Thr 


Ala 


Tyr 


He 


Tyr 


Ser 


Ser 


Ala 


His 


Phe 


Ala 


Asn 


Asn 


Val 






275 










280 










285 








Phe 


Val 


Asp 


Gly 


Ser 


Leu 


Ser 


Tyr 


Ser 


His 


Phe 


Asn 


Asn 


Asp 


Leu 


Ser 




290 










295 










300 










Ala 


Thr 


Met 


Ser 


Asn 


Gly 


Thr 


Tyr 


Val 


Asp 


Gly 


Ser 


Thr 


Asn 


Ser 


Asp 


305 










310 










315 










320 


Ala 


Trp 


Gly 


Phe 


Gly 


Leu 


Lys 


Ala 


Gly 


Tyr 


Asp 


Phe 


Lys 


Leu 


Gly 


Asp 










325 










330 










335 




Ala 


Gly 


Tyr 


Val 


Thr 


Pro 


Tyr 


Gly 


Ser 


Val 


Ser 


Gly 


Leu 


Phe 


Gin 


Ser 








340 










345 










350 






Gly 


Asp 


Asp 


Tyr 


Gin 


Leu 


Ser 


Asn 


Asp 


Met 


Lys 


Val 


Asp 


Gly 


Gin 


Ser 






355 










360 










365 








Tyr 


Asp 


Ser 


Met 


Arg 


Tyr 


Glu 


Leu 


Gly 


Val 


Asp 


Ala 


Gl v 


Tvr 


Thr 


Phe 




370 










375 










380 










Thr 


Tyr 


Ser 


Glu 


Asp 


Gin 


Ala 


Leu 


Thr 


Pro 


Tyr 


Phe 


Lvs 


Leu 


Ala 


Tvr 


385 










390 










395 










400 


Val 


Tyr 


Asp 


Asp 


Ser 


Asn 


Asn 


Asp 


Asn 


A.sp 


Val 


Asn 


Gly 


Aso 


Ser 


lie 










'405 










410 










415 




Asp Asn 


Gly 


Thr 


Glu 


Gly 


Ser 


Ala 


Val 


Arg 


Val 


Gly 


Leu 


Gly 


Thr 


Gin 








420 










425 










4 30 






Phe 


Ser 


Phe 


Thr 


Lys/ 


Asn 


Phe 


Ser 


Ala 


Tyr 


Thr 


Asp 


Ala 


Asn 


Tyr 


Leu 






435 










440 










445 








Gly 


Gly 


Gly 


Asp 


Val 


Asp 


Gin 


Asp 


Trp 


Ser 


Ala 


Asn 


Val 


Gly 


Val 


Lys 



450 455 460 



Tyr Thr Trp 

465 •■ 



<210> 382 
<211> 222 
<212> PRT 
<213> E. Coli 

<400> 382 



Met 


Pro 


Val 


Lys 


Asp 


Leu 


Thr 


Gly 


lie 


Thr 


Ala 


Lys 


Asp 


Ala 


Gin 


Met 


1 








5 










10. 










15 




Leu 


Ser 


Val 


Val 


Lys 


Pro 


Leu 


Gin 


Glu 


Phe 


Gly 


Lys 


Leu 


Asp 


Lys 


Cys 








20 










25 










30 






Leu 


Ser 


Arg 


Tyr 


Gly 


Thr 


Arg 


Phe 


Glu 


Phe 


Asn 


Asn 


Glu 


Lys 


Gin 


Val 






35 










40 










45 








He 


Phe 


Ser 


Ser 


Asp 


Val 


Asn 


Asn 


Glu 


Asp 


Thr 


Phe 


Val 


lie 


Leu 


Glu 




50 










55 










60 










Gly Val 


He 


Ser 


Leu 


Arg 


Arg 


Glu 


Glu 


Asn 


Val 


Leu 


lie 


Gly 


lie 


Thr 


65 










70 










75 










80 


Gin 


Ala 


Pro 


Tyr 


He 


Met 


Gly 


Leu 


Ala 


Asp 


Gly 


Leu 


Met 


Lys 


Asn 


Asp 










85 










90 










95 




lie 


Pro 


Tyr 


Lys 


Leu 


He 


Ser 


Glu 


Gly 


Asn 


Cys 


Thr 


Gly 


Tyr 


His 


Leu 








100 










105 










110 






Pro 


Ala 


Lys 


Gin 


Thr 


lie 


Thr 


Leu 


lie 


Glu 


Gin 


Asn 


Gin 


Leu 


Trp 


Arg 






115 










120 










125 
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Asp Ala 


Pne 


Tyr 


Trp 


Leu 


Ala 


Trp 


Gin 




130 










135 






Asp 


val 


Gin 


Leu 


lie 


Gly 


His 


Asn 


Ser 


14 5 










150 








Leu 


Leu 


Ser 


Met 


lie 


Asp 


Trp 


Asn 


Glu 










165 










Val 


Met 


Asn 


Tyr 


lie 


His 


Gin 


Arg 


Thr 








180 










185 


Ala 


Glu 


Val 


Leu 


Ala 


Ala 


Leu 


Arg 


Lys 






195 










200 


Lys 


Gly 


Lys 


Leu 


Val 


Ala 


He 


Asn 


Arg 




210 










215 






PCT/USOO/02200 



Asn Arg He Leu Glu Leu Arg 
14 0 

Tyr Glu Gin He Arg Ala .Thr 
155 160 
Glu Leu Arg Ser Arg He Gly 
170 175 
Arg He Ser Arg Ser Val Val 
190 

Gly Gly Tyr He Glu Met Asn 
205 

Leu Pro Ser Glu Tyr 
220 



<210> 383 
<211> 84 
<212> PRT 
<213> E. Coli 

<400> 383 

Met Thr Asp Lys He Arg Thr Leu Gin Gly Arg Val Val Ser Asp Lys 

1 5 10 15 

Met Glu Lys Ser He Val Val Ala lie Glu Arg Phe Val Lys His Pro 

20 25 30 

lie Tyr Gly Lys Phe lie Lys Arg Thr Thr Lys Leu His Val His Asp 

35 40 45 

Glu Asn Asn Glu Cys Gly lie Gly Asp Val Val Glu lie Arg Glu Cys 

50 55 60 

Arg Pro Leu Ser Lys Thr Lys Ser Trp Thr Leu Val Arg Val Val Glu 
65 70 75 80 

Lys Ala Val Leu 



<210> 384 

<211> 63 

<212> PRT 

<213> E . Coli 

<400> 384 

Met Lys Ala Lys Glu Leu Arg Glu Lys Ser Val Glu Glu Leu Asn Thr 

.1 5 10 .15 

Glu Leu Leu Asn Leu Leu Arg Glu Gin Phe Asn Leu Arg Met Gin Ala 

20 25 .30 

Ala Ser Gly Gin Leu Gin Gin Ser His Leu Leu Lys Gin Val Arg Arg 

35 ' 40 45 

Asp Val Ala Arg Val Lys Thr Leu Leu Asn Glu Lys Ala Gly Ala 

50 55 60 



<210> 385 
<211> 136 
<212> PRT 
<213> E. Coli 



<400> 385 

Met Leu Gin Pro Lys Arg Thr Lys Phe Arg Lys Met His Lys" Gly Arg 

15 10 15 

Asn Arg Gly Leu Ala Gin Gly Thr Asp Val Ser Phe Gly Ser Phe Gly 

20 25. 30 

Leu Lys Ala Val Gly Arg Gly Arg Leu Thr Ala Arg Gin lie Glu Ala 
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35 40 45 

Ala Arg Arg Ala Met Thr Arg Ala Val Lys Arg Gin Gly Lys lie Trp 

50 55 60 

lie Arg Val Phe Pro Asp Lys Pro lie Thr Glu Lys Pro Leu Ala Val 
65 70 75 80 

Arg Met Gly Lys Gly Lys Gly Asn Val Glu Tyr Trp Val Ala Leu lie 

85 90 95 

Gin Pro Gly Lys Val Leu Tyr Glu Met Asp Gly Val Pro. Glu Glu Leu 

100 105 no 

Ala Arg Glu Ala Phe Lys Leu Ala Ala Ala Lys Leu Pro lie Lys Thr 

115 120 125 

Thr Phe Val Thr Lys Thr Val Met 
130 135 



<210> 386 
<211> 233 
<212> PRT 
<213> E. Coli 

<400> 386 

Met Gly Gin Lys Val His Pro Asn Gly lie Arg Leu Gly He Val Lys 

1 5 10 15 

Pro Trp Asn Ser Thr Trp Phe Ala Asn Thr Lys Glu Phe Ala Asp Asn 

20 25 " 30 

Leu Asp Ser Asp Phe Lys Val Arg Gin Tyr Leu Thr Lys Glu Leu Ala 

35 40 45 

Lys Ala Ser Val Ser Arg He Val He Glu Arg Pro Ala Lys Ser He 

50 55 60 

Arg Val Thr He His Thr Ala Arg Pro Gly He Val He Gly Lys Lys 
65 , 7 0 75 80 

Gly Glu Asp Val Glu Lys Leu Arg Lys Val Val Ala Asp He Ala Gly 

85 90 * 95 

Val Pro Ala Gin He Asn He Ala Glu Val Arg Lys Pro Glu Leu Asp 

100 105 " no 

Ala Lys Leu Val Ala Asp Ser He Thr Ser Gin Leu Glu Arg Arg Val 

H5 120 125 

Met Phe Arg Arg Ala Met Lys Arg Ala Val Gin Asn Ala Met Arg Leu 

130 135 140 

Gly Ala Lys Gly lie Lys Val Glu Val Ser Gly Arg Leu Gly Gly Ala 
14 5 .150. 155 * ^ 160 

Glu He Ala Arg Thr Glu Trp Tyr Arg Glu Gly Arg Val Pro Leu His 

165 170 175 

Thr Leu Arg Ala Asp- He. Asp Tyr Asn Thr Ser Glu Ala His Thr Thr 

180 185- 190 

Tyr Gly Val He Gly Val Lys Val Trp He Phe Lys Gly Glu He Leu 

195 200 205 

Gly Gly Met Ala Ala Val Glu Gin Pro Glu Lys Pro Ala Ala Gin Pro 

210 215 220 

Lys Lys Gin Gin Arg Lys Gly Arg Lys 
225 230 



<210> 387 

<211> 110 

<212> PRT 

<213> E: Coli 

<400> 387 
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Met 


Glu 


Thr 


He 


Ala 


Lvs 


His 


Arg 


His 


Ala 
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Ser 
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Lys 


1 








5 










10 










15 


Val 


Arq 


Leu 


Val 


Ala 


Asd 


Leu 


He 


Arg 


Gly 


Lys 


i-j y 


VOX 




r,i n 
o x n 


-rt.x a 








20 










25 
















Leu 


Asp 


He. 


Leu 


Thr 


Tvr 


Thr 


Asn 


Lys 


Lys 


Ala 


Ala 


Val 




v d x 


Lys 






35 










40 










4 5 






Lys 


Val 


Leu 


Glu 


Ser 


Ala 


He 


Ala 


Asn 


Ala 


Glu 


His 


L c n 




vj x y 


nld 




50 • 










55 










60 






Asp 


lie 


Asp 


Asp 


Leu 


Lvs 
j 


Val 


Thr 


Lys 


He 


Phe 


Val 


Asp 


Glu 


Gly 


t X O 


65 










70 










75 






80 


Ser 


Met 


Lys 


Arg 


He 


Met 


Pro 


Arg 


Ala 


Lys 


Gly 


Arg 


Ala 


Asp 


Arg 


He 










85 










90 










95 




Leu 


Lys 


Arg 


Thr 


Ser 


His 


He 


Thr 


Val 


Val 


Val 


Ser 


Asp 


Arg 












100 










105 








110 







<210> 388 

<211> 92 

<212> PRT 

<213> E . Coli 

<400> 388 
Met Pro Arg Ser Leu Lys Lys Gly 

1 5 
Met Lys Val Glu Lys Ala Val Glu 
20 

Thr Trp Ser Arg Arg Ser Thr He 

35 • 40 

He Ala Val His Asn Gly Arg Gin 

50 55 
Glu Met Val Gly His Lys Leu Gly 
65 70 
Arg Gly His Ala Ala Asp Lys Lys 
85 



Pro Phe He Asp Leu His Leu Leu 

10 15 
Ser Gly Asp Lys Lys Pro Leu Arg 
25 30 
Phe Pro Asn Met He Gly Leu Thr 
45 

His Val Pro Val Phe Val Thr Asp 
60 

Glu Phe Ala Pro Thr Arg Thr Tyr 

75 80 
Ala Lys Lys Lys 
90 



<210> 389 
<211> 273 
<212> PRT 
<213> E. Coli 





<400> 
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Pro 
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Arg 
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Val 
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Val 


Lys 


Val 


Val 
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Pro 


Glu 
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His 
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Gly 


Lys 


Pro 
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Ala 


Pro 
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Glu 
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Ser 
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Gly 


Gly 


Arg 
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Asn 
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Gly 


Arg 






35 










40 










45 
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Thr 


Arg 


His 


He 


Gly 


Gly 


Gly 


His 


Lys 
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He 




50 
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Val 


Asp 
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Val 
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Glu 


Arg 
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70 
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Glu 


Arg 
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Gly 


Asp 
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He 
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Gly 
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Pro 
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Val 


Gin 


lie 


Val 
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Gly 


Glu 
180 


Met 


Ara 


Lvs 


Val 


Glu 
185 


Ala 


Asp 


Cy s 




A1 » 
190 


i nr 


Le u 


Gly" 


Glu 


Val 


Gly 


Asn 


Ala 


Glu 


His 


Met 


Leu 


Arg 


Val 


Le u 


Gly 


Lys 


7\ "j -j 
J-\X a 






195 










200 










205 




Gly 


Ala 


Ala 


Arg 


Trp 


Arg 


Gly 


Val 


Ara 


Pro 


Thr 


Val 


Arg 


Gly 


Thr 


A 1 ^ 




210 










215 










220 








Met 


Asn 


Pro 


Val 


Asp 


His 


Pro 


His 


Gly 


Gly 


Gly 


Glu 


Gly 


Arg 


Asn 


Phe 


225 










230 










235 








240 


Gly 


Lys 


His 


Pro 


Val 


Thr 


Pro 


Trp 


Gly 


Val 


Gin 


Thr 


Lys 


Gly 


Lys 


Lys 










245 










250 










255 


Thr 


Arg 


Ser 


Asn 


Lys 


Arg 


Thr 


Asp 


Lys 


Phe 


He 


Val 


Arg 


Arg 


Arg 


Ser 



260 265 270 



Lys 



<210> 390 
<211> 100 
<212> PRT 
<213> E. Coli 





<400> 


390 










Met 


He 


Arg 


Glu 


Glu 


Arg 


Leu 


Leu 


1 








5 








Ser 


Glu 


Lys 


Ala 
20 


Ser 


Thr 


Ala 


Met 


Lys 


Val 


Ala 
35 


Lys 


Asp 


Ala 


Thr 


Lys 
40 


Lys 


Leu 
50 


Phe 


Glu 


Val 


Glu 


Val 
55 


Glu 


Gly 


Lys 


Val 


Lys 


Arg 


His 


Gly 


Gin 


65 










70 






Lys 


Lys 


Ala 


Tyr 


Val 
85 


Thr 


Leu 


Lys 


Gly 


Gly 


Ala 


Glu 
100 











Lys Val Leu Arg Ala Pro His Val 

10 15 

Glu Lys Ser Asn Thr He Val Leu 

25 ■ 30 

Ala Glu lie Lys Ala Ala Val Gin 
45 

Val Val Asn Thr Leu Val Val Lys 
60 

Arg He Gly Arg Arg Ser Asp Trp 

75 8 0 

Glu Gly Gin Asn Leu Asp Phe Val 
90 95 



<210> 391 
<211> 201 
<212> PRT 
<213> E. Coli 

<400> 391 

Met Glu Leu Val Leu Lys Asp Ala Gin Ser Ala Leu Thr Val Ser Glu 

1 5 10 15 

Thr Thr Phe Gly Arg Asp Phe Asn Glu Ala Leu Val His Gin Val Val 

20 25 30 

Val Ala Tyr Ala Ala Gly Ala Arg Gin Gly Thr Arg Ala Gin Lys Thr 

35 4 0 4 5 

Arg Ala Glu Val Thr Gly Ser Gly Lys Lys Pro Trp Arg Gin Lys Gly 

50 55 60 

Thr Gly Arg Ala Arg Ser Gly Ser He Lys Ser Pro He Trp Arg Ser 
65 70 75 80 
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Gly Gly Vai Thr 

Asn Lys Lys ' Met 
100 

Val Arg Gin Asp 
115 

Pro Lys Thr Lys 
130 

Asp Val Leu lie 
145 

Ala Arg Ash Leu 

Pro Val Ser Leu 
180 

Val Lys Gin Val 
195 



Phe Ala Ala Arg 
85 

Tyr Arg Gly Ala 

Arg Leu lie Val 
120 

Leu Leu Ala Gin 
135 

lie Thr Gly Glu 
150 

His Lys Val Asp 
165 

lie Ala Phe Asp 

Glu Glu Met Leu 
200 



Pro Gin Asp His 
90 

Leu Lys Ser lie 
105 

Val Glu Lys Phe 

Lys Leu Lys Asp 
140 

Leu Asp Glu Asn 
155 

Val Arg Asp Ala 
170 

Lys Val Val Met 

185 

Ala 



Ser Gin Lys Val 
95 

Leu Ser Glu Leu 

• 110 

Ser Val Glu Ala 
125 

Met Ala Leu Glu 

Leu Phe Leu Ala 
160 

Thr Gly lie Asp 
175 

Thr Ala Asp Ala 
190 



<210> 392 
<211> 209 
<212> PRT 
<213> E. Coli 

<400> 392 

Met lie Gly Leu Val Gly Lys Lys Val Gly Met Thr Arg lie Phe Thr 

1 5 .10 15 

Glu Asp Gly Val Ser He Pro Val Thr Val He Glu Val Glu Ala Asn 

20 25 30 

Arg Val Thr Gin Val Lys Asp Leu Ala Asn Asp Gly Tyr Arg Ala lie 

35 40 45 

Gin Val Thr Thr Gly Ala Lys Lys Ala Asn Arg Val Thr Lys Pro Glu 

50 55 60 

Ala Gly His Phe Ala Lys Ala Gly Val Glu Ala Gly Arg Gly Leu Trp 
65 70 75 - - 8q 

Glu Phe Arg Leu Ala Glu Gly Glu Glu Phe Thr Val Gly Gin Ser lie 

8 5 90 ' 95 

Ser Val Glu Leu Phe Ala Asp Val Lys Lys Val Asp Val Thr Gly Thr 

100 105 " HO 

Ser Lys Gly Lys Gly Phe Ala Gly Thr Val Lys Arg Trp Asn Phe Arg 

115 . 120 125 

Thr Gin Asp Ala Thr His Gly Asn Ser Leu Ser His Arg Val Pro Gly 

130 135 140 

Ser He Gly Gin Asn Gin Thr Pro Gly Lys Val Phe Lys Gly Lys Lys 
145 150 155 " 160 

Met Ala Gly Gin Met Gly Asn. Glu Arg Val Thr Val Gin Ser Leu Asp 

165 170 175 

Val Val Arg Val Asp Ala Glu Arg Asn Leu Leu Leu Val Lys Gly Ala 

180 185 190 

Val Pro Gly Ala Thr Gly Ser Asp Leu lie Val Lys Pro Ala Val Lys 
195 200 205 

Ala 



<210> 393 
<211> 103 
<212> PRT 
<213> E . Coli 
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<400> 


393 


























Met 


Gin 


Asn 


Gin 


Arg 


lie 


Arg 


He 


Arg 


Leu 


Lys 


Ala 


Phe 


Asp 


His 


Aro 


1 








5 










10 










15 


Leu 


He 


Asp 


Gin 


Ala 


Thr. 


Ala 


Glu 


He 


Val 


Glu 


Thr 


Ala 


Lys 


Arq 


Thr 








20 










25 










30 




Gly 


Ala 


Gin 


Val 


Arg 


Gly 


Pro 


lie 


Pro 


Leu 


Pro 


Thr 


Arg 


Lys 


Glu 


Arq 






35 










40 










45 






Phe 


Thr 
50 


Val 


Leu 


He 


Ser 


Pro 
55 


His 


Val 


Asn 


Lys 


Asp 
60 


Ala 


Arq 


Asp 


Gin 


Tyr 


Glu 


He 


Arq 


Thr 


His 


Leu 


Arcr 


Leu 


Val 


Asp 


He 


Val 


Glu 


Pro 


Thr 


65 










70 










75 










80 


Glu 


Lys 


Thr 


Val 


Asp 
85 


Ala 


Leu 


Met 


Arg 


Leu 
90 


Asp 


Leu 


Ala 


Ala 


Gly 
95 


Val 


Asp 


Val 


Gin 


He 
100 


Ser 


Leu 


Gly 





















<210> 394 
<211> 118 
<212> PRT 
<213> E. Coli 





<400> 


394 


























Met 


Ala 


Arg 


Val 


Lys 


Arg 


Gly 


Val 


lie 


Ala 


Arg 


Ala 


Arg 


His 


Lys 


Lys 


1 








5 










10 










15 


lie 


Leu 


Lys 


Gin 


Ala 


Lys 


Gly 


Tyr 


Tyr 


Gly Ala 


Arg 


Ser 


Arg 


Val 


Tyr 








20 










25 










30 




Arg 


Val 


Ala 


Phe 


Gin 


Ala 


Val 


lie 


Lys 


Ala 


Gly 


Gin 


Tyr 


Ala 


Tyr 


Arg 






35 










40 










45 




Asp 


Arg 


Arg 


Gin 


Arg 


Lys 


Arg 


Gin 


Phe 


Arg 


Gin 


Leu 


Trp 


He 


Ala 


Arg 




50 










55 










60 








He 


Asn 


Ala 


Ala 


Ala 


Arg 


Gin. 


Asn 


Gly 


lie 


Ser 


Tyr 


Ser 


Lys 


Phe 


He 


65 










70 










75 








80 


Asn 


Gly 


Leu 


Lys 


Lys 
85 


Ala 


Ser 


Val 


Glu 


lie 
90 


Asp 


Arg 


Lys 


lie 


Leu 
95 


Ala 


Asp 


lie 


Ala 


Val 


Phe 


Asp 


Lys 


Val 


Ala 


Phe 


Thr 


Ala 


Leu 


Val 


Glu 


Lys 








100 










105 










110 




Ala 


Lys 


Ala 
115 


Ala 


Leu 


Ala 























<210> 395 

<211> 65 

<212> PRT 

<213> E . Coli 

<400> 395 

Met Pro Lys lie Lys Thr Val Arg Gly Ala Ala Lys Arg Phe Lys Lys 

1 5 10 . 15 

Thr Gly Lys Gly Gly Phe Lys His Lys His Ala Asn Leu Arg His lie 

20 - 25 30 

Leu Thr Lys Lys Ala Thr Lys Arg Lys Arg His Leu Arg Pro Lys Ala 

35 4 0 4 5 

Met Val Ser Lys Gly Asp Leu Gly Leu Val lie Ala Cys Leu Pro Tyr 
50 55 60 

Ala - 
65 
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. <210> 396 

<211> 180 

<212> PRT 

<213> E . Coli 

<400> 396 

Met Lys Gly Gly Lys Arg Val Gin Thr Ala Arg Pro Asn Arg lie Asn 

1 5 10 15 

Gly Glu lie Arg Ala Gin Glu Val' Arg Leu Thr Gly Leu Glu Gly Glu 

20 2 5 30 

Gin Leu Gly lie Val Ser Leu Arg Glu Ala Leu Glu Lys Ala Glu Glu 

35 40 45 

Ala Gly Val Asp Leu Val Glu lie Ser Pro Asn Ala Glu Pro Pro Val 

50 55 60 

Cys Arg He Met Asp Tyr Gly Lys Phe Leu Tyr Glu Lys Ser Lys Ser 
65 70 75 ~ 80 

Ser Lys Glu Gin Lys Lys Lys Gin Lys Val He Gin Val Lys Glu - He 

85 90 95 

Lys Phe Arg Pro Gly Thr Asp Glu Gly Asp Tyr Gin Val Lys Leu Arg 

100 105 ~ HO 

Ser Leu He Arg Phe Leu Glu Glu Gly Asp Lys Ala Lys He Thr Leu 

115 120 125 

Arg Phe Arg Gly Arg Glu Met Ala His Gin Gin He Gly Met Glu Val 

130 135 140 

Leu Asn Arg Val Lys Asp Asp Leu Gin Glu Leu Ala Val Val Glu Ser 
145 150. 155 160 

Phe Pro Thr Lys lie Glu Gly Arg Gin Met He Met Val Leu Ala Pro 
165 170 175 

Lys Lys Lys Gin 
180 



<210> 397 
<211> 642 
<212> PRT 
<213> E. Coli 

<400> 397 

Met Pro Val He Thr Leu Pro Asp Gly Ser Gin Arg His Tyr Asp His 

1 5 10 15 

Ala Val Ser Pro Met Asp Val Ala Leu Asp He Gly Pro Gly Leu Ala 

20 25 ^30 

Lys Ala Cys He Ala Gly Arg Val Asn Gly Glu. Leu Val Asp Ala Cys 

35 4 0 4 5 

Asp Leu He Glu Asn Asp Ala Gin Leu Ser lie lie Thr Ala Lys Asp 

50 55 60 

Glu Glu Gly Leu Glu He lie Arg His Ser Cys Ala His Lea Leu Gly 
65 70 75 80 

His Ala lie Lys Gin Leu Trp Pro His Thr Lys Met Ala lie Gly Pro 

8 5 90 95 

Val lie Asp Asn Gly Phe Tyr Tyr Asp Val Asp Leu Asp Arg Thr Leu 

100 105 HO 

Thr Gin Glu Asp Val Glu Ala Leu Glu Lys Arg Met His Glu Leu Ala 

115 120 125 

Glu Lys Asn Tyr Asp Val He Lys Lys Lys Val Ser Trp His Glu Ala 

130 135 " 140 

Arg Glu Thr Phe Ala Asn Arg Gly Glu Ser Tyr Lys Val Ser lie Leu 
145 150 155 160 

Asp Glu Asn lie Ala His Asp Asp Lys Pro Gly Leu Tyr Phe His Glu 
165 170 175 
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Glu Tyr Val Asp Met Cys Arg Gly Pro His Val Pro Asn Met Arg Phe 

180 185 190 

Cys His His Phe Lys Leu 'Met Lys Thr Ala Gly Ala Tyr Trp Arg Gly 

195 200 205 

Asp Ser Asn Asn Lys Met Leu Gin Arg lie Tyr Gly Thr Ala Trp Ala 

210 215 ' 220 

Asp Lys* Lys Ala Leu Asn Ala Tyr Leu Gin Arg Leu Glu Glu Ala Ala 
225 230 235 240 

Lys Arg Asp His Arg Lys lie Gly Lys Gin Leu Asp Leu Tyr His Met 

245 250 255 

Gin Glu Glu Ala Pro Gly Met Val Phe Trp His Asn Asp Gly Trp Thr 

260 265 270. 

lie Phe Arg Glu Leu Glu Val Phe Val Arg Ser Lys Leu Lys Glu Tyr 

275 280 " 285 

Gin Tyr Gin Glu Val Lys Gly Pro Phe Met Met Asp Arg Val Leu Trp 

290 295 300 

Glu Lys Thr Gly His Trp Asp Asn Tyr Lys Asp Ala Met Phe Thr Thr 
305 310 315 320 

Ser Ser Glu Asn Arg Glu Tyr Cys lie Lys Pro Met Asn Cys Pro Gly 

325 330 335 

His Val Gin lie Phe Asn Gin Gly Leu Lys Ser Tyr Arg Asp Leu Pro 

340 345 350 

Leu Arg Met Ala Glu Phe Gly Ser Cys His Arg Asn Glu Pro Ser Gly 

355 360 365 

Ser Leu His Gly Leu Met Arg Val Arg Gly Phe Thr Gin Asp Asp Ala 

370 375 380 

His lie Phe Cys Thr Glu Glu Gin lie Arg Asp Glu Val Asn Gly Cys 
385 390 395 4 00 

lie Arg Leu Val Tyr Asp Met Tyr Ser Thr Phe Gly Phe Glu Lys lie 

405 410 415 

Val Val Lys Leu Ser Thr Arg Pro Glu Lys Arg lie Gly Ser Asp Glu 

420 425 430 

Met Trp Asp Arg Ala Glu Ala Asp Leu Ala Val Ala Leu Glu Glu Asn 

435 440 445 

Asn lie Pro Phe Glu Tyr Gin Leu Gly Glu Gly Ala Phe Tyr Gly Pro 

4 50 4 55 4 60 

Lys lie Glu Phe Thr Leu Tyr Asp Cys Leu Asp Arg Ala Trp Gin Cys 
465 470 475 480 

Gly Thr Val Gin Leu Asp Phe Ser Leu Pro Ser Arg Leu Ser Ala Ser 

485 490 495 

Tyr Val Gly Glu Asp Asn Glu Arg Lys Val Pro Val Met lie His Arg 

500 505 510 

Ala lie Leu Gly Ser Met Glu Arg Phe lie Gly lie Leu Thr Glu Glu 

515 520 525 

Phe Ala Gly Phe Phe Pro Thr Trp Leu Ala Pro Val Gin Val Val lie 

530 535 540 

Met Asn lie Thr Asp Ser Gin Ser Glu Tyr Val Asn Glu Leu Thr Gin 
545 550 555 560 

Lys Leu Ser Asn Ala Gly He Arg Val Lys Ala Asp Leu Arg Asn Glu 

565 570 575 

Lys He Gly Phe Lys He Arg Glu His Thr Leu Arg Arg Val Pro Tyr 

580 585 590 

Met Leu Val Cys Gly Asp Lys Glu Val Glu Ser Gly Lys Val Ala Val 

595 600 605 

Arg Thr Arg Arg Gly Lys Asp Leu Gly Ser Met Asp Val Asn Glu Val 

610 615 620 

He Glu Lys Leu Gin Gin Glu He Arg Ser Arg Ser Leu Lys Gin Leu 
625 630 635 ' 640 

Glu Glu 
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<210> 398 
<211> 450 
<212> PRT 
<213> E . Coli 

<400> 398 

Met Thr Lys His Tyr Asp Tyr lie Ala lie Gly Gly Gly Ser Gly Gly 

1 5 10 ' * 15 

lie Ala Ser lie Asn Arg Ala Ala Met Tyr Gly Gin Lys Cys Ala Leu 

20 25 30 

lie Glu Ala Lys Glu Leu Gly Gly Thr Cys Val Asn Val Gly Cys Val 

35 4 0 4 5 

Pro Lys Lys Val Met Trp His Ala Ala Gin lie Arg Glu Ala lie His 

50 55 60 

Met Tyr Gly Pro Asp Tyr Gly Phe Asp Thr Thr He Asn Lys Phe Asn 
65 70 75 80 

Trp Glu Thr Leu He Ala Ser Arg Thr Ala Tyr He Asp Arg He His 

85 90 95 

Thr Ser Tyr Glu Asn Val Leu Gly Lys Asn Asn Val Asp Val He Lys 

100 105 no 

Gly Phe Ala Arg Phe Val Asp Ala Lys Thr Leu Glu Val Asn Gly Glu 

115 120 125 

Thr lie Thr Ala Asp His He Leu He Ala Thr Gly Gly. Arg Pro Ser 

130 135 140 

His Pro Asp He Pro Gly Val Glu Tyr Gly He Asp .Ser Asp Gly Phe 
145 150 155 160 

Phe Ala Leu Pro Ala Leu Pro Glu Arg Val Ala Val Val Gly Ala Gly 

165 170 175 

Tyr He Ala Val Glu Leu Ala Gly Val He Asn Gly Leu Gly Ala Lys 

180 185 ' 190 

Thr His Leu Phe Val Arg Lys His Ala Pro Leu Arg Ser Phe Asp Pro 

195 200 205 

•Met He Ser Glu Thr Leu Val Glu' Val Met Asn Ala Glu Gly Pro Gin 

210 215 ' 220 

Leu His Thr Asn Ala He Pro Lys Ala Val Val Lys Asn Thr Asp Gly 
225 230 235 240 

Ser Leu Thr Leu Glu Leu Glu Asp Gly Arg Ser Glu Thr Val Asp Cys 

245 250 255 

Leu He Trp Ala He Gly Arg Glu Pro Ala Asn Asp Asn He Asn Leu 

260 265 270 

Glu Ala Ala Gly Val Lys Thr Asn Glu Lys Gly Tyr lie Val Val Asp 

275 280 " 285 

Lys Tyr Gin Asn Thr Asn He Glu Gly He Tyr Ala Val Gly Asp Asn 

290 295 300 

Thr Gly Ala Val Glu Leu Thr Pro Val Ala Val Ala Ala Gly Arg Arg 
305 310 315 320 

Leu Ser Glu Arg Leu Phe Asn Asn Lys Pro Asp Glu His Leu Asp Tyr 

325 330 " 335 

Ser Asn He Pro Thr Val Val Phe Ser His Pro Pro lie Gly Thr Val 

340 345 350 

Gly Leu Thr Glu Pro Gin Ala Arg Glu Gin Tyr Gly Asp Asp Gin Val 

355 360 365 

Lys Val Tyr Lys Ser Ser Phe Thr Ala Met Tyr Thr Ala Val Thr Thr 

370 375 380 

His Arg Gin Pro Cys Arg Met ■ Lys Leu Val Cys Val Gly Ser Glu Glu 
385 - 390 395 400 

Lys lie Val Gly. He His Gly lie Gly Phe Gly Met Asp Glu Met Leu 

405 410 * 415 

Gin Gly Phe Ala Val Ala Leu Lys Met Gly Ala Thr Lys Lys Asp Phe 
420 425 430 
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Asp Asn Thr Val Ala He His Pro Thr Ala Ala Glu Glu Phe Val Thr 
435 440 445 

Met Arg 
450 



<210> 399 
<211> 2894 
<212> RNA 
<213> E. Coli 



<400> 399 

aagguuaagc cucacgguuc auuaguaccg guuagcucaa cgcaucgcug cgcuuacaca 60 

cccggccuau caacgucguc gucuucaacg uuccuucagg acccuuaaag ggucagggag 120 
aacucaucuc ggggcaaguu ucgugcuuag augcuuucag cacuuaucuc uuccgcauuu " 180 

agcuaccggg cagugccauu ggcaugacaa cccgaacacc agugaugcgu ccacuccggu 240 

ccucucguac uaggagcagc cccccucagu ucuccagcgc ccacggcaga uagggaccga 300 

acugucucac gacguucuaa acc.cagcucg cguaccacuu uaaauggcga acagccauac 360 

ccuugggacc uacuucagcc ccaggaugug augagccgac aucgaggugc caaacaccgc 420 

cgucgauaug aacucuuggg cgguaucagc cuguuauccc cggaguaccu uuuauccguu 4 80 

gagcgauggc ccuuccauuc agaaccaccg. gaucacuaug accugcuuuc gcaccugcuc 540 

gcgccgucac gcucgcaguc aagcuggcuu augccauugc acuaaccucc ugauguccga 600 

ccaggauuag ccaaccuucg ugcuccuccg uuacucuuua ggaggagacc gccccaguca 660 

aacuacccac cagacacugu ccgcaacccg gauuacgggu caacguuaga acaucaaaca 720 

uuaaagggug guauuucaag gucggcucca ugcagacugg cguccacacu ucaaagccuc 780 

ccaccuaucc uacacaucaa ggcucaaugu ucagugucaa gcuauaguaa agguucacgg 840 

ggucuuuccg ucuugccgcg gguacacugc aucuucacag cgaguucaau uucacugagu 900 

cucgggugga gacagccugg ccaucauuac gccauucgug caggucggaa cuuacccgac 960 

aaggaauuuc gcuaccuuag gaccguuaua guuacggccg ccguuuaccg gggcuucgau 1020 

caagagcuuc gcuugcgcua accccaucaa uuaaccuucc ggcaccgggc aggcgucaca 1080 

ccguauacgu ccacuuucgu guuugcacag ugcuguguuu uuaauaaaca guugcagcca 114 0 

gcugguaucu ucgacugauu ucagcuccau ccgcgaggga ccucaccuac auaucagcgu 1200 

gccuucuccc gaaguuacgg caccauuuug ccuaguuccu ucacccgagu ucucucaagc 1260 

gccuugguau ucucuaccug accaccugug ucgguuuggg guacgauuug auguuaccug 1320 

augcuuagag gcuuuuccug gaagcagggc auuuguugcu ucagcaccgu agugccucgu 1380 

caucacgccu cagccuugau uuuccggauu ugccuggaaa accagccuac acgcuuaaac 1440 

cgggacaacc gucgcccggc caacauagcc uucuccgucc ccccuucgca guaacaccaa 1500 

guacaggaau auuaaccugu uucccaucga cuacgccuuu cggccucgcc uuaggggucg 1560 

acucacccug ccccgauuaa cguuggacag gaacccuugg ucuuccggcg agcgggcuuu 1620 

ucacccgcuu uaucguuacu uaugucagca uucgcacuuc ugauaccucc agcaugccuc 1680 

acagcacacc uucgcaggcu uacagaacgc. uccccuaccc aacaacgcau aagcgucgcu 174 0 

gccgcagcuu cggugcaugg uuuagccccg auacaucuuc cgcgcaggcc gacucgacca 1800 

gugagcuauu acgcuuucuu uaaaugaugg cugcuucuaa gccaacaucc uggcugucug 1860 

ggccuuccca caucguuucc cacuuaacca ugacuuuggg accuuagcug gcggucuggg 1920 

uuguuucccu cuucacgacg gacguuagca cccgccgugu gucucccgug auaacauucu 1980 

ccgguauucg caguuugcau cggguuggua agucgggaug acccccuugc cgaaacagug 204 0 

cucuaccccc ggagaugaau ucacgaggcg cuaccuaaau agcuuucggg gagaaccagc 2100 

uaucucccgg uuugauuggc cuuucacccc cagccacaag ucauccgcua auuuuucaac 2160 

auuagucggu ucgguccucc aguuaguguu acccaaccuu caaccugccc auggcuagau .2220 

caccggguuu cgggucuaua cccugcaacu uaacgcccag uuaagacucg guuucccuuc 2280 

ggcuccccua uucgguuaac cuugcuacag aauauaaguc gcugacccau uauacaaaag 234 0 

guacgcaguc acacgccuaa gcgugcuccc acugcuugua cguacacggu uucagguucu 2400 

uuuucacucc ccucgccggg guucuuuucg ccuuucccuc acgguacugg uucacuaucg 24 60 

gucagucagg aguauuuagc cuuggaggau ggucccccca uauucagaca ggauaccacg 2520 

ugucccgccc uacucaucga gcucacagca ugugcauuuu uguguacggg gcugucaccc 2580 

uguaucgcgc gccuuuccag acgcuuccac uaacacacac acugauucag gcucugggcu 2640 

gcuccccguu cgcucgccgc uacuggggga aucucgguug auuucuuuuc cucgggguac 2700 

uuagauguuu caguuccccc gguucgccuc auuaaccuau ggauucaguu aaugauagug 2760 

ugucgaaaca cacuggguuu ccccauucgg aaaucgccgg uuauaacggu ucauaucacc 2820 

uuaccgacgc uuaucgcaga uuagcacguc cuucaucgcc ucugacugcc agggcaucca 2880 
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<210> 400 
<211> 120 
<212> RNA 
. <213> E . Coli 



<400> 400 

augccuggca guucccuacu cucgcauggg gagaccccac acuaccaucg gcgcuacggc 60 
guuucacuuc ugaguucggc auggggucag gugggaccac cgcgcuacgg ccgccaggca 120 



<210> 401 

<211> 76' 

<212> RNA 

<213> E. Coli 

<400> 401 

guccccuucg ucuagaggcc caggacaccg cccuuucacg gcgguaacag ggguucgaau 60 
ccccuagggg acgcca " ' " 



<210> 402 
<211> 1549 
<212> RNA 
<213> E. Coli 



<400> 402 

aaauugaaga guuugaucau ggcucagauu gaacgcuggc ggcaggccua acacaugcaa 60 

gucgaacggu aacaggaagc agcuugcugc uucgcugacg aguggcggac gggugaguaa 120 

ugucugggaa gcugccugau ggagggggau aacuacugga aacgguagcu aauaccgcau 180 

aaugucgcaa gaccaaagag ggggaccuuc gggccucuug ccaucggaug ugcccagaug 240 

ggauuagcuu guuggugggg uaacggcuca ccaaggcgac gaucccuagc uggucugaga 300 

ggaugaccag ccacacugga acugagacac gguccagacu ccuacgggag gcagcagugg 360 

ggaauauugc acaaugggcg caagccugau gcagccaugc cgcguguaug aagaaggccu 420 

ucggguugua aaguacuuuc agcggggagg aagggaguaa aguuaauacc uuugcucauu 4 80 

gacguuaccc gcagaagaag caccggcuaa cuccgugcca. gcagccgcgg uaauacggag 54 0 

ggugcaagcg uuaaucggaa uuacugggcg uaaagcgcac gcaggcgggu ugguuaaguc 600 

agaugugaaa uccccgggcu caaccuggga acugcaucug auacuggcaa gcuugagucu 660 

cguagagggg gguagaauuc cagguguagc ggugaaaugc guagagaucu ggaggaauac 720 

cgguggcgaa ggcggccccc uggacgaaga cugacgcuca ggugcgaaag cguggggagc 780 

aaacaggauu agauacccug guaguccacg ■ ccguaaacga ugucgacuug gagguugugc 840 

ccuugaggcg uggcuuccgg agcuaacgcg uuaagucgac cgccugggga guacggccgc 900 

aagguuaaaa cucaaaugaa uugacggggg cccgcacaag cgguggagca ugugguuuaa 960 

uucgaugcaa cgcgaagaac cuuaccuggu cuugacaucc acggaaguuu ucagagauga 1020 

gaaugugccu ucgggaaccg ugagacaggu gcugcauggc ugucgucagc ucguguugug 1080 

aaauguuggg uuaagucccg caacgagcgc aacccuuauc cuuuguugcc agcgguccgg 1140 

ccgggaacuc aaaggagacu gccagugaua aacuggagga agguggggau gacgucaagu 1200 

caucauggcc cuuacgacca gggcuacaca cgugcuacaa uggcgcauac aaagagaagc 1260 

gaccucgcga gagcaagcgg accucauaaa gugcgucgua guccggauug gagucugcaa 1320 

cucgacucca ugaagucgga aucgcuagua aucguggauc agaaugccac ggugaauacg 1380 

uucccgggcc uuguacacac cgcccgucac accaugggag uggguugcaa aagaaguagg 1440 

uagcuuaacc uucgggaggg cgcuuaccac uuugugauuc augacugggg ugaagucgua 1500 

acaagguaac cguaggggaa ccugcgguug gaucaccucc uuaccuuaa 154 9 
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•<210> 403 
<211> 17 
<212> DNA 
<213> Artificial 
<220> 

<223> Primer Oligonucleotide 
<400> 403 

tgtttatcag accgctt 17 

<210> 404 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer Oligonucleotide 
<400> 404 

acaatttcac acagcctc ^8 



<210> 405 
<211> 159 
<212> DNA 

<213> Escherichia coli 
<400> 405 

caggtggtat ggaaacccaa aatggagacg ggaagctgaa ccagatagtt actggaggtg 
atcaccagca gatgaaataa cgataaccag aacaacgcct tatagcgttg agtttgcgag 
aaaacgttca tattgtacct ttttgattaa ccattgggg 

<210> 406 
<211> 640 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc^feature 
<222> (i; . . . (640) 
<223> n = A, T, C or G 



60 
120 
159 



<400> 406 

ggggnccaaa 

aanaaaaggc 

ataaggggac 

accattcggc 

gaccagatr c 

gt cgccca :c 

ctgctggraa 

cacatcaccc 

tttcggcact 

aacgaaatcg 

gccggaacr g 



gtgtttgggn 
ggggatttgt 
aaagngaagg 
cattcgcctt 
gctt cnggca 
atgaagtatt 
tacat cccca 
agtgtctctt 
tcaaagaat c 
ctcggttcca 
catcccggt t 



cgggcaactg 
nagccacggg 
aagtggntat 
cttggttatc 
caaagcccca 
gtcccggagg 
cctgatcctg 
tacgctcgga 
cgctggt cgc 
cgtt tgagta 
gaat cgtcag 



gaggccaacc 
ngattanttt 
taanggannc 
gaagttcat c 
gtaacggctg 
aacaatccag 
cgcaatcggc 
aagacgaatt 
ttccccacca 
ggtgaccggc 
ctcttttgag 



ttaanttngg 
anaataaat t 
gccaatgcga 
cagatagccg 
tccgcgctgt 
gttgccagtt 
actgtcagaa 
ccattttctt 
ttacggcgtg 
agcgcgtttt 



ggaaattttt 
aagtgttgcc 
nttagggcag 
ttgccngacc 
tgtcgcggtt 
gttgccctgg 
tgcggtgcgt 
tggtttcgtt 
agaaggtctg 
cacacgcctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
640 



<210> 40? 
<211> 682 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc feature 
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<222> (1) . . . (682) 
<223> n = A,T,C or G 



<400> 407 

cctgcagggt 

cccagtcggc 

gacgtaagcg 

cgctttgtag 

ccttcgccag 

gcagccgcgg 

caaatatttc 

tggtcaaagt 

ctacagcaca 

atggagatct 

cattgagatc 

ggaaagtatc 



aatgt cgcca 
agcgacaact 
gatggagtgg 
cgtacgcgcg 
cataaaagag 
ttgagtatca 
ctgatagcta 
tgatgttttt 
cgtagaaagc 
tttcttcact 
gctatgaaat 
tggttaagaa 



ttaaactggc 
tgcgttaaag 
ccggaaacct 
gcattggcaa 
gaggatgctc 
ctgaataaag 
tcggtatcag 
tagtctgttg 
acccgacaat 
ggcctaaaaa 
atcaacaact 
gc 



gcaggcagcc 
tcgcaaaatt 
catagtgacc 
taagattcag 
gcgtatgcag 
gatcgttttc 
gaaccaggtc 
tcaaagccgc 
actcctggca 
gctgatattc 
tggaaaatct 



aaagagttgc 
atcatctgca 
gcccaccagt 
atactcagac 
caactgctcc 
gtcaatcaaa 
acgccatgca 
nattataccn 
tgggcgttaa 
tgtaaagagt 
tgnaaagcng 



t ccgct tcta 
ctcact gcgt 
tggcct gcat 
tcttccgggg 
agcgcaaatt 
tgtggctgag 
agtttcgtaa 
gtaaccggca 
agctcacagg 
t acacngtaa 
gttggaaaat 



<210> 408 
<211> 309 
<212> DNA 

<213> Escherichia coii 
<400> 408 

ggggatccgg cagaatttta cgctgaccaa tgacgcgacg acgtggcatg gaaatactcc 
. gttgttaatt caggattgtc caaaactcta cgagtttagt ttgacattta agttaaaacg 
tftggcctta cttaacggag aaccattaag ccttaggacg cttcacgcca tacttggaac 
gagcctgctt acggtcttta acgccggagc agtcaagcgc accacgtacg gtgtggtaac 
gaacacccgg gaggtcttta acacgaccgc cacggatcag gatcacggag tgctcctgca 
gccaagctt ' 

<2~10> 409 
<211> 1167 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
•660 
682 



60 
120 
180 
240 
300 
309 



<400> 409 
gtcgacccat 
tggaacactt 
cagaacggac 
cagcagtagt 
ctaaaaagtc 
ttaacttcaa 
, ttgctcacgc 
agacccaggc 
gcttt cagaa 
acagctacag 
tacaacgtcc 
catttaaat k 
taactgcgaw 
gccagccgaa 
cagagttgcc 
tttgacctca 
cgggtgt tec 
gaacggagta 
gccagcttcg 
aatccgcaac 



ctgtccattg 
tccgcaatgc 
gtcctgtcgc 
caaacagtgt 
accagccatc 
cttcagcgcc 
cttctttcag 
cagttgcgcc 
t tacgtcgaa 
cagcagcagc 
attacagaca 
gttcctgaat 
taagcagctt 
gcttctttca 
aggeggt cga 
aattttgeat 
a tagagt atg 
ccttcaacag 
cgaccigctt 
tactgeaage 



ageggacagt 

ctgttgctat 

agtatattaa 

gaaaegctae 

agect gattt 

agcttcttcc 

ageagceggt • 

aegtactget 

ttcagttttt 

ggaaacaccg 

tagctgeaac 

atcagaat aa 

ytttegcat c 

tggttgccat 

tetgagaege 

tcgctttcgc 

caatcagggt 

cacggcgcag 

tacgeagtte 

caagctt 



• ttgtgcaaca 
caegcttaaa 
gtegtcgata 
tggegectta 
ctcaggctgc 
agagcttttt 
gcagattcta 
ttgataacag 
tcttcagcag 
aatttttctt 
tgcttcaatg 
gtttatacgt 
gegtacagma 
caggegtgea 
egggatcage 
gaactctttg 
cggaccaaca 
cagggtgtta 
agtcatt.tta 



ctattttgtt 
ccatttcatt 
gaaacaagca 
cagegcaaaa 
aaceggaagg 
teagtgette 
ccaggtcttt 
caactttgtt 
cttcaaccgg 
yeattgeaga 
awttgatctt 
aagcgaa tgc 
gecagagtae 
attgettett 
tcaccttcaa 
aacagacgag 
aacgegtett 
cgaacaacac 
tctacagtta 



gaeeggaaaa 
gcgatttaca 
ttgaaaggca 
aggctggtga 
gttggcttiat 
tgegtegtet 
agcttctttc 
agcgccagca 
gccagcagct 
gatcaagttc 
tagtgataga 
gttaaaaaga 
gaaccagttt 
egtaggtegg 
aggcagegge 
cagcagcgcc 
t caggcactc 
gcatgtatac 
cgcccacggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1167 



<210> 410 

<211> 404 

<212> DNA 

<213> Escherichia coli 
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<400> 410 

caacmctatt 

cttaaaccat 

tcgatagaaa 

gccttacagc 

ggctgcaacc 

c.ttttttcag 

attctaccag 



ttgktggacc 
ttcattgcga 
caagcattga 
gcaaaaaggc 
ggaagggttg 
tgcttctgcg 
gtctttagct 



ggaaaakgga 
tttacacaga 
aaggcacagc 
tggtgactaa 
gcttatttaa 
tcgtctttgc 
tctttcagac 



acactttccg 
acggacgtcc 
agtagtcaaa 
aaagtcacca 
cttcaacttc 
tcacgccttc 
ccaggccagt 



cawkgcctgt 
tgtcgcag'ta 
cagtgtgaaa 
gccat cagcc 
agcgccagct 
ttt cagagca 
tgcg 



tgctat cacg 
tattaagtcg 
cgctactggc 
tgatttctca 
tcttccagag 
gccggtgcag 



<210> 411 . 
<211> 152 
<212> DNA 

<213> Escherichia coli 



<400> 411 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcaa gagcagcccg 

gtgcagattc taccaggtct ttagcttctt tcagacccag gccagttgcg ccacgtactg 

ctttgataac agcaactttg ttagcgccaq ca 



:ttg 

<210> 412 
<211> 825 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (1) . . . (825) 
<223> n = A, T, C or G 



<400> 412 

gatccgtcga 

gaaaatggaa 

ttacacagaa 

aggcacagca 

ggtgactaaa 

cttatttaac 

cgtctttgct 

tctttcagac 

ccagcagctt 

agcagctaca 

tcaagttcta 

gtgatagaca 

taaaaagata 

accagtttgc 



cccatctgtc 
cactttccgc 
cggacgt'cct 
gtagtcaaac 
aagtcaccag 
ttcaacttca 
cacgccttct 
ccaggccagt 
tcagaattac 
gctacagcag 
caacgtccat 
tttaaattgt 
actgcgatta 
cagccgaagg 



cattgagcgg 
aatgcctgtt 
gtcgcagtat 
agtgtgaaac 
ccatcagcct 
gcgccagctt 
tt cagagcag 
tgcgccacgt 
gtcgaattca 
cagcagcgga 
tacagacata 
tcctgaatat 
agcagcttct 
ttggcttttc 



acagtttgtg 
gctat cacgc 
attaagtcgt 
gctactggcg 
gatttctcag 
cttccagagc 
ccgggtgcag 
actgctttga 
agttttttct 
aacaccgaat 
gctgcaactg 
cagaataagt 
ttcgcatcgc 
agcctnnncn 



caacactatt 
ttaamccatt 
cgatagaaac 
ccttacagcg 
gctgcaaccg 
ttttttcagt 
attctaccag 
taacagcaac 
tcagcagctt 
ttttcttyca 
cttcaatgat 
ttatacgtaa 
gtacagcagc 
natta 



ttgttgaccg 
tcattgcgat 
aagcattgaa 
caaaaaggct 
gaagggttgg 
gcttctgcgt 
gtctttagct 
tttgttagcg 
caaccgggcc 
ttggcagaga 
t kgatcttwa 
gcgaatgcgt 
cagaggtcga 



<210> 413 
<211> 425 
<212> DNA 

<213> Escherichia coli 



<400> 413 

agtagtcaaa 

aaaagtcacc 

aacttcaact 

gctcacgcct 

acccaggcca 

tttcagaatt 

agctacagca 

acaac 



caggt gkgra 
agccatcarc 
tcagcgccag 
tctttcagag 
gttgcgccac 
acgtcgaatt 
gcagcagcgg 



acgctactgg 
ctgatttctc 
cttcttccag 
cagccggtgc 
gtact gcttt 
cagtt ttttc 
aaacacccga 



cgccttacag 
aggctgcaac 
agcttttttc 
agattctacc 
gataacagca 
ttcagcagct 
atttttcttc 



cgcaaaaagg 
ccggaagggt 
agtgcttctg 
aggtctttag 
actttgttag 
tcaaccgggc 
cattgcagag 



ct ggtgacta 
tggcttattt 
cgtcgtcttt 
cttctttcag 
cgccagcagc 
cagcagctac 
atcaagttct 



60 
120 
180 
240 
300 
360 
404 . 



60 
120 
152 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
825 



60 
120 
180 
240 
300 
360 
420 
425 
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<210> 414 " 
<211> 126 
<212> DNA 

<213> Escherichia coli 



PCT/U$00/02200 



<400> 414 

agagcttttt tcagtgcttc tgcgtcgtct ttgctcacgc cttctttcag agcagccggt 
gcagattcta ccaggtcttt agcttctttc agacccaggc cagttgcgcc acgtactqct 
ttrata 

<210> 415 
<211> 264 
<212> DNA 

<213> Escherichia coli 
<400> 415 

ctgcmacccg gargggttgg cttatttaac ttcaacttca gcgccagctt cttycagagc 
ttttttcaag tgcttctgcg tcgtctttgc tcacgccttc tttcagagca gccggtgcag 
attctaccag gtctttagct tctttcagac ccaggccagt tgcgccacgt actgctttga 
taacagcaac tttgttagcg ccagcagctt tcagaattac gtcgaattca gttttttctt 
cagcagcttc aaccgggcca gcag 

<210> 416 
<211> 201 
<212> DNA 

<213> Escherichia coli 
<400> 416 

cgcataccct gcagcatcgg cccgatggag atcaggtcgg cagaacgctg taccgctttg 
taggtggtgt taccggtgtt cagatccggg aagatgaaca cggtagcgcg acctgcaacc 
ggagagttcg gcgctttgga tttcgcaacg tcagccatta ccgcagcgtc gtactgcagc 
ggaccgtcga tcatcaggtc a 

<210> 417 
<211> 239 
<212> DNA 

<213> Escherichia coli 
<400> 417 

aattcagcag ttgacagtgg cataaacgta actggtgact -tttgcccggc atgacgccgg 
gcttttttta ttattccgtg acttccagcg tagtgaaggc aaacttctcg ccatcaaata 
gcccctgact ggttagtttt agcgcgggga tcactggcag agaaagaaac gccatctgaa 
taaacggctc atcgggtaac ggaccgcatt ■ cacgggcggc ggctttcaag gcgtcaatt 

<210> 418 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 418 

ttcttttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 
ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 
ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 
aatactattt aaatattatt ttgagcatat gcacataagg ttg 

<210> 419 
<211> 223 
<212> DNA 

<213> Escherichia coli 



60 
120 
126 



60 
120 
180 
240 
264 



60 
120 
180 
201 



60 
120 
180 
239 



60 
120 
180 
223 



<400> 419 
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ttcttttttt cgtcaacggt gtccagaatc attttattta cctcgggtac ttatgctgat 60 

ttttattatt atggggaagg tgttatttat gagtttcatt tatgccgtaa cgacaatgaa 120 

ctcgggaatt agtataagca gcgcgagaat aataatcatt gtgcaaatgc taatttaatt 180 

aatactattt aaatattatt ttgagcatat gcacataagg ttg 223 

<210> 420 
<211> 212 
<212> DNA 

<213> Escherichia coli 
<400> 420 

aatagcgggt atgcacgcct ttcttttttt cgtcaacggt gtccagaatc attttattta 60 
cctcgggtac ttatgctgat ttttattatt atggggaagg tgttatttat gagtttcatt 120 
tatgccgtaa cgmcaa tgaa ■ ct cgggaatt agtataagca gcgcgagaat aataatcatt 180 
gtgcaaatgc taatttaatt aatactattt aa 212 

<210> 421 
<211> 438 
<212> DNA 

<213> Escherichia coli 



<400> 421 

ccctctaaat 

caggccat aa 

agcggggcgg 

cacagcaaca 

atctttagcc 

gtcgcgacac 

gcgcacatt c 

aagcgttcgs 



tatcgcccgt 
atgccaccag 
agagctgttt 
tgatgcctct 
acgcgaccag 
aagcaacagg 
cgggtaatcg 
gcccat kg 



ggcataaaaa 
aattatcgtc 
cagttcggcg 
gtacaaccct 
gacaccgttg 
tagtgaatat 
aaccccactt 



ctgcgt ccaa 
aaccaaccaa 
ggtaaccctt 
aacgtgccaa 
aaaaatcccg 
cctgcgttca 
gaaacatcaa 



acgccgtctt 
ttgctgaaac 
caatccattt 
gggtggcaac 
cgagcaaacc 
gtaacatccc 
tattgsgsgt 



tgccagcagc 
gccaagcagc 
gccgccagtc 
aatggcaggg 
aagcagt aaa 
caacagcacc 
aagcattwcc 



60 
120 
180 
240 
300 
360 
420 
438 



<210> 422 
<211> 682 
<212> DNA 

<213> Escherichia coli 



<400> 422 

aattcccggg 

cggt aagtat 

tacgcnaag 

gcggtatcag 

tggtgaatgc 

ttctctgtat 

catcacaaaa 

gtct gaaggt 

cattaacaag 

ggct cgttt a 

ctgacagttt 

atgcaggagt 



gatccgtcga 

ggcggggtta 

tgacaacccc 

ttttactccg 

ggctgagcgc 

ccggcgttaa. 

ttcattgttg 

tgttttgaaa 

agaaaacaag 

cgtctgatgc 

ttcttgtttc 

gatgactgca 



ccgtgcgctt 
ttccttcccc 
gctgcaacgc 
tgact gctct 
acgcggaaca 
ttgttaactg 
aggacgcgat 
t t ggtgtcac 
aacgggagct 
caaaaggatg 
cggagagcca 
gc 



ccggttgtgg 
gttgaggaca 
cctctgttat 
gccgcccttt 
gttaaaacca 
gttaacgtca 
aatgaaaacg 
tat cagtaac 
at t aaataaa 
t gcacaatga 
gttgatattg 



caacccgcga 
ccgggttgtc 
caattttctg 
ttaaagtgaa 
aaaacagtgt 
cctggaggca 
ttattaccaa 
ccagtattta 
atatgcattg 
attcagcatt 
cagtcagtgt 



aatggcgcgg 
aggttgacca 
gtgacgtttg 
t tttgtgatg 
tatgggtgga 
ccaggcactg 
acgttaatac 
ctgaagatgc 
t ttcaatgct 
tgtgcttgtt 
t cacaggaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
682 



<210> 423 
<211> 600 
<212> DNA 

<213> Escherichia coli 
<400> 423 

ggggatccga ttgtgactgc tctgccgccc tttttaaagt gaattttgtg atgtggtgaa 60 
tgcgcctgag cgcacgcgga acagttaaaa ccaaaaacag tgttatgggt ggattctctg 120 
tatccggcgt taattgttaa ctggttaacg tcacctggag gcaccaggca ctgcatcaca 180 
aaattcactg ttgaggacgc gataatgaaa acgttattac caaacgttaa tacgtctgaa 240 
ggttg-tttg aaattggtgt cactatcagt aacccagtat ttactgaaga tgccattaac 300 
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aagagaaaac aagaacggga gctattaaat aaaatatgca ttgtttcaat gctggctcgt • 360 

ttacgtctga tgccaaaagg atgtgcacaa tgaattcagc atttgtgctt gttctgacag 420 

tttttcttgt ttccggagag ccagttgata ttgcagtcag tgttcacagg acaatgcagg 480 

agtgtatgac tgcagcaacc gaacagaaaa ttcccggtaa ctgttacccg gtcgataaag 540 

ttattcacca ggataatatc gaaatcccgg caggtcttta aacagttccg taataaataa 600 



<210> 424 

<211> 100 

" <212> DNA 

<213> Escherichia coli 

<400> 424 

gggatccagc aagaagatgc ggttgtaccg tcatcacgca gatgcgcaaa gctactcagc 60 
aactgacctt tcttcgcaat aagcacgcca ttagcgtcat " ' 100 

<210> 425 
<211> 465 
<212> DNA 

<213> Escherichia coli 
<400> 425 

tcgcgtgttf accttcaaca tcggtaactt tctggcggat agtttcacgg taagcaacct 60 
gcggtttacc tacgttcgct tcaacgttga attcacgctt catacggtca acgatgatgt 120 
cgaggtgcag ttcgcccata cccgcgatga tggtctggtt agattcttcg tcagtccata 180 
cacggaaaga cgggtcttct ttagccagac ggcccagagc cagacccatt ttttcctggt 240 
cagctttggt tttcggttca actgcgatgg agattaccgg ctcagggaat tccatacgtt 300 
ccagaatgat cggcgcatcc gggtcacaca gggtgtcacc agtggttacg tctttcagac 360 
cgatagcagc agcgatgtcg cccgcgcgaa cttctttgat ctcttcacgt ttgttagcgt 420 
gcatctgaac gatacgaccg aaacgctcac gtgcagcttt cacgg " 4 65 

<210> 426 
<211> 653 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc__f eature 
<222> (1) . . . (653) 
<223> n = A, T, C or G 

<400> 426 

tgatcggctc aagcagaact ggtttcgctt tcttaaagcc ttctttaaag gcgatagaag 60 

cagccagttt aaacgccagt tcagaggagt caacgtcatg gtaagaaccg aagtgcagac 120 

gaatacccat gtctactacc gggtagcctg ccagcggacc tgctttcagc tgttcctgga 180 

tacctttatc aacggccggg a.tgtattcgc cagggattac * accaccttta atgtcgttga 240 

tgaactcgta gcctttcggg tttgaacccg gctccagcgg gtacatgtcg ataacaacat 300 

gaccatactg accacgacca ccagactgtt tcgcgtgttt accttcaaca tcggtaactt 360 

tctggcggat agtttcacgg taagcaacct gcggtttacc tacgttcgct tcaacgttga 420 

attcacgctt catacggtca acgatgatgt cgaggtgcag ttcgccatac ccgcgatgat 480 

ggctgggtag attcttcgtc agtccataca cggnaagacg ggtcttnttt agccagacgg 540 

gccagagnca gacccatttt tttctggcag ctttggnttc ggtcaactgc gatggaaata 600 

cccggctcaa ggaatrcata cgtttcanaa tgatcggggc attccgggtc aca " 653 

<210> 427 
<211> 268 
<212> DNA 

<213> Escherichia coli 
<400> 427 

ctttcttaaa gccttcttta aaggcgatag aagcagccag tttaaacgcc agttcagagg 60 
agtcaacgtc atggtaagaa ccgaagtgca gacgaatacc catgtctact accgggtagc 120 
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ctgccagcgg acctgcTTtc agctgttcct ggataccttt atcaacggcc gggatgtatt 
cgccagggat tacaccacct ttaatgtcgt tgatgaactc gtagcctttc gggtttgaac 
ccggctccag cgggtacatg tcgataac 

<210> 428 
<211> 330 • 
<212> DNA 

<213> Escherichia coli 



180 
240 
268 



<400> 428 

gttttgggga gatgtaaggg ctaatctgaa tggctgcatt ccttgtttaa 
atgactgatt gccgatacct gattaaacgg gtcatcaaaa tcatcattgc 
ctgatccttc tgttcttata ' acacaaggaa acgtacttaa ggtgcgtccg 
cggac'gcacc tttaataact ataaataagt gtctgggcag atactatata 
gtgaatgatt atgctaatgt catcaattaa ataaatataa tggcgttaag 
aatataatta atactctact tccagagtag 

<210> 429 
<211> 465 
<212> DNA 

<213> Escherichia coli 



ggaaaaacga 
tgttttacag 
gtgaaccagt 
aattaactta 
.gcttcccagt 



60 
120 
180 
240 
300 
330 



<400> 429 

gttttgggga 

atgactgatt 

ctgatccttc 

cggacgcacc 

gtgaatgatt 

aatataatta 

gcacaaattt 

tattggggat 



gatgtaaggg 
gccgatacct 
tgttcttata 
tttaataact 
atgctaatgt 
atactctact 
atcccacaac 
cgggcacacc 



ctaatctgaa 
gattaaacgg 
acacaaggaa 
ataaataagt 
catcaattaa 
tccagagtag 
tgttcttctg 
ttctggcagg 



<210> 430 
<211> 379 
<212> DNA 

<213> Escherichia coli 



tggctgcatt 
gtcatcaaaa 
acgtacttaa 
gtctgggcag 
ataaatataa 
aatattaaat 
tctcgacatg 
ttggtgtctc 



ccttgtttaa 
tcatcattgc 
ggtgcgtccg 
atactatata 
tggcgttaag 
tttatccgcg 
cgccggatct 
gtagt 



ggaaaaacga 
tgttttacag 
gtgaaccagt 
aattaactta 
gcttcccagt 
tggtgcatca' 
ttcacaatag 



60 
120 
180 
"240 
300 
360 
420 
4 65 



<400> 430 

aatctgaatg 

ttaaacgggt 

acaaggaaac 

aaataagtgt 

tcaattaaat 

cagagtagaa 

ttcttctgtc 



gctgcattcc 
catcaaaatc 
gtacttaagg 
ctgggcagat 
aaatataatg 
tattaaattt 
tcgacatgc 



ttgtttaagg 
atcattgctg 
tgcgtccggt 
actatataaa 
gcgttaaggc 
tatccgcgtg 



aaaaacgaat 
ttttacagct 
gaaccagtcg 
ttaacttagt 
ttcccagtaa 
gtgcatcagc 



gactgattgc 
gatccttctg 
gacgcacctt 
gaatgattat 
tataattaat 
acaaatttat 



cgatacctga 
tt cttataac 
taataactat 
gctaatgtca 
actctacttc 
cccacaactg 



60 
120 
.180 
240 
300 
360 
379 



<210> 431 
<211> 443 
<212> DNA 

<213> Escherichia coli 



<400> 431 

aagatgatgt 

aaaaaacact 

attagtttat 

tgggtat caa 

aatcccattg 

aaaccaatat 

ccctgcat ac 

agataaatgt 



gatgagaaag 
aaatcaaaaa 
ttcaaatgag 
ataaacaaca 
cctggcggat 
cacgcagcag 
cagccagcat 
actgcccgat 



t caa t t tgaa 
ataa tggcat 
gaaaatctcc 
gaggagaaat 
aagttgcggc 
ttttttcgcc 
caacgccgca 
gtc 



taagacaat a 
tagaaaatat 
cggcgaaaaa 
ttttaacgca 
ttaacgccag 
ggattggt ac 
ctgtgcttgc 



ttaagagcta 
aatgcgaaaa 
accgggagat 
gccattcagg 
gaa t gcgtgtc 
cggaaaacag 
ggctacgctc 



aaaaaatgtc 
cggaggtgaa 
gaaagtgtga 
caaatcgttt 
ggccagtttc 
atcgcggaat 
atagcgacgc 



60 
120 
180 
240 
300 ' 
3 60 
420 
443 
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<210> 432 
<211> 638 
<212> DNA 

<213> Escherichia coli 
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<400> 432 

cagggggttt 

tacaataaac 

agtcactcta 

cgcgcaattt 

taaattaaaa 

taaatgcttg 

tccaaaaaga 

agcaataaaa 

gttagtaata 

atggaaataa 

ggtgcttggt 



gttgtgggca 
aacgaatcag 
agaggaggag 
aaacttagtg 
aaacgactgt 
cttaaatctc 
gatactacaa 
aataataaca 
gtcgttagta 
ttttctttat 
ttgctaacca 



atgatgcatt 
ggcatttgat 
aaattaggtt 
ctttacat eg 
tatgtataag 
tatatccttc 
ataaagatgc 
atgatataaa 
tgtttgattc 
actgagacat 
ataaaaagag 



taagttatcg 
agtcaatacc 
ggrattatag 
ctattgtctt 
caaaggtccg 
cccgaaaaat 
ctttatttta 
tctaatgttt 
tccatatatt 
cacaccatca 
tgeatteg 



tetgeagata 
gcaattctat 
cttgtgcgcg 
gatttctttg 
aacgaaaaat 
gacacat aaa 
ttatttctaa 
ttaaatatat 
acgtgtagtt 
tcaaatggaa 



gaggagatat 
caggagatat 
ccatgattgg 
aattatttta 
acattccaaa 
attgagatat 
taaaaataga 
tgtcttttat 
ttttatatac 
gtttgaagat 



<210> 433 
<211> 299 
<212> DNA 

<213> Escherichia coli 
<400> 433 

ctttacctgg catgatccac ttcgccagaa taceggcaat aagcccaaaa ataatccatg 
acagaatgee cattgtttcc tcactta.tct gttttgeatt agegggttag tegctgataa 
aaagcatagc acaacategg gagggcaaga tttgtgacga geatcaegga ggtttttttg 
egatggegea gaaattgege catcaacgat cagtgataat taccaaccac aaacatcatg 
ttcgttttcc gtgtcataag aacgtacggt attcaccaga tcttttatca cttcagccg 

<210> 434 
<211> 388 
<212> DNA 

<213> Escherichia coli 



<400> 434 

gaaaaaggag- 

ttattgtgct 

atcgacctga 

cgcggcatcg 

gcgcacgcta 

atgaatcgcg 

cacgcccagc 



geaatategg 
gcaggtgttt 
tcatcaaact 
gctttcat ca 
tataccggcc 
ttatcaat eg 
gacacagaac 



gtaaaggcat 
tagcgggttg 
gaatagegge 
tccgcaccat 
ccagtttacg 
ctgccttacg 
gaattccc 



tagcccgacg 
ttgatccaca 
ctgctcgtaa 
tgggctgggc 
atgaaagccg 
cgctttgtct 



aataegtegg 
ggttctaact 
gtttcctggg 
tgatagttgg 
ttcgccagtt 
ttataggcat 



gctacaaata 
ggaagaccac 
cggacaccgg 
aaacatggta 
cctgcgcctg 
ccggctgcgc 



<210> 435 
<211> 351 
<212> DNA 

<213> Escherichia coli 



<400> 435 

ctatccttga 

acagaaaaag 

atattattgt 

cacatcgacc 

cggcgcggca 

gtagcgcacg 



tgaaaccgcg 
gaggcaatat 
getgeaggtg 
tgatcatcaa 
teggctttea 
ctatataccg 



agcaaagata 
egggtaaagg 
ttttagcggg 
actgaatagc 
t catccgcac 
gccccagtt t 



ggtgattacg 
cattagcccg. 
ttgttgatcc 
ggcctgctcg 
cat tgggctg 
acgatgaaag 



tcatggtttt 
aegaataegt 
acaggttcta 
taagtttcct 
ggctgatagt 
ccgttcgcca 



acagaaaat t 
egggctacaa 
actggaagac 
gggeggacac 
tggaaacatg 

g 



<210> 436 
<211> 762 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
638 



60 
120 
180 
240 
299 



60 
120 
180 
240 
300 
360 
388 



60 
120 
180 
240 
300 
351 
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<220> 

<221> misc^feature 
<222> (1) . . . (762) 
<223> n =■ A, T, C or G 
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<400> 43*6 
aattatgaaa 
•gtaataacac 
ggaggtaagc 
gccataaata 
agcttccccg 
aaagtatgcc 
gcaaatattc 
ttagcgatag 
aaacgggaaa 
acgtcatcat 
gggaggggga 
tactgcttan 
ttggatattt 



cactgtctgg 
aggaaactat 
cgacgatttc 
aggaaagggt 
cttactaaga 
ttcacgataa 
tttgcgagtc 
actgcattca 
gcctctcccg 
actggtgata 
aacctcttcc 
cgctaccagt 
taatgaaagc 



aatcgtctga 
tttatctacg 
agcgggacgc 
catgatgaag 
ct accagggc 
gattgtcaat 
gttacgcaat 
gggcgaaagg 
gagaagaggg 
ctcttagtga 
taaccct cac 
ttgtctgccc 
cgacaaatca 



atgacgggca 

cgttagcgat 

tgaaacggga 

ctactcatca 

gggggaaacc. 

ccgcaggctt 

aatcacagag 

aggtaagccg 

cttttaataa 

ttagcttcca 

ttctgaaatt 

tggcggttgt 

atcancgtga 



ca t ttgcgag 
agactgctt g 
aagcccctcc 
tcgtggtgct 
ccgctctacc 
tgtagtctgc 
gaaactattt 
atgatttcag 
ggaaagggtt 
ggcttactaa 
gggtgctatg 
aacgccagat 
eg 



cacgcatcca 
catggcgaaa 
cgaggaaggg 
cttagt cata 
ctcact cctg 
gatcctgcca 
tattcacgeg 
egggaegctg 
atgatgaagc 
gaacaccagg 
acgctggcgt 
cggtacccgt 



<210> 437 
<211> 292 
<212> DNA 

<213> Escherichia coli 
<400> 437 

cacatttgeg agcacgcatc cagtaataac acaggaaact attttatcta cgcgttagcg 
atagactget tgcatggcga aaggaggtaa gecgacgatt teagegggae getgaaaegg 
gaaagcccct cccgaggaag gggccataaa taaggaaagg gtcatgatga agctactcat 
catcgtggtg ctcttagtca taagcttccc cgcttactaa gactaccagg gcgggggaaa 
ccccgctcta ccctcactcc tgaaagtatg ccttcacgat aagattgtca at 

<210> 438 
<211> 631 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7 62 



60 
120 
180 
240 
292 



<400> 438 

atttacactt 

tatggcagga 

ttaaaaataa 

cagccaccgc 

ttgetgaett 

agtacaacat 

ggacctatat 

cctacgacgg 

tgcagacacc 

tatttcacgt 

tagaegtagt 



tttacgaaat 
aagatatgcg 
gatgttgctg 
gggt tcgacc 
taagcatttc 
taagcagtgg 
gggtggcgcg 
tgagattttt 
ttgttatttt 
tgattaatgc 
aactggctgt 



catgggatca 

actgatatta 

ggtgcgcttt . 

aatacct egg 

aaaccagggg 

cagttgcgta 

tacgtgttga 

tatcatcget 

ttattattag 

ggttgcctcc 

tateggaatt 



<210> 439 
•<211> 566 
<212> DNA 

<213> Escherichia coli 



ctaacaaaat 
cagatcccca 
tgctggttac 
gaatttctaa 
acaccgt acc 
acctgcccgc 
tcagcgacac 
aaaaaaagee 
ccacttgctc 
agtgcgccag 
9 



atcgcttgtc 
aagtggagag 
cagtgccgcc 
gtatgagtta 
agaaatgtac 
gcctgatgcc 
egaeggtaaa 
ccctcatcat 
gtcttgcttg 
atttaacttt 



agttatattg 
tttatgacca 
tgggccgcac 
agtagtttca 
cgtaccgatg 
gggaegcact 
atcattaaag 
gagggggaaa 
ttattagtcg 
gtttgtatcg 



60 
120 
180 
.240 
300 
360 
420 
480 
540 
600 
631 



<400> 439 

tatggcagga aagatatgcg actgatatta cagatcccca aagtggagag tttatgacca 60 

ttaaaaataa gatgttgctg ggtgcgcttt tgctggttac cagtgccgcc tgggccgcac 120 

cagccaccgc gggttcgacc aatacctegg gaatttctaa gtatgagtta agtagtttca 180 

ttgetgaett taagcatttc aaaccagggg acaccgtacc agaaatgtac cgtaccgatg 240 
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agtacaacat 
ggacctatat 
cctacgacgg 
tgcagacacc 
tatttcacgt 
tagacgtagt 

<210> 440 
<211> 339 
<212> DNA 
<213> Escherichia coli 



taagcagtgg cagttgcgta acctgcccgc gcctgatgcc gggacgcact 300 

gggtggcgcg tacgtgttga tcagcgacac cgacggtaaa atcattaaag 360 

tgagattttt tatcatcgct aaaaaaagcc ccctcatcat gagggggaaa 420 

ttgttatttt ttattattag ccacttgctc gtcttgcttg ttattagtcg 480 

tgattaatgc ggttgcctcc agtgcgccag atttaacttt gtttgtatcq 540 

aactggctgt atcgaa ' " 566 



<400> 440 

cgtattcaca 

atcttcatta 

ggagcataaa 

tcggcactga 

tgcagccttt 

aatcgcacca 



tccttttgat 
cagcggtcgt 
gatgaaaaaa 
gctgggatgg 
gtgcaattta 
ataacggcaa 



tggtgataac atgcgaatcg .gtattatttt tccggttgta 
atttttagca tggtttttta ttggcggcta .tgctgccccg 
acaacgatta ttatgatggg tgtggcgatt attgtcgtac 
tggtaacgtc acctctaaaa aatagcaaag gctgcctgtg 
agcgttaact tttaatcttc ctgtagataa atagcacgac 
ccacgaagct gccaaaatt 



<210> 441 
<211> 376 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
339 



<400> 441 

catgaatatt 

cataaaatat 

acaaagaagc 

cttttaattt 

catacataaa 

gtccatctct 

ttaaaacaac 



t aaaaaggaa 
ttccgtggag 
attgaatgca 
aacatgtaaa 
tggagtcatg 
aattgcata't 
tgtaca 



aacgacatga 
cattttatta 
gggaaaaata 
cgcatggtta 
ttttcccttt 
tttaattttt 



aaccgaagca 
ttgaatatag 
atatggccat 
at cctcatat 
tccatttatc 
ctgataaatg 



cagaatcaac 
aggtttaact 
aaaaaacatc 
cacgggtgga 
aagttcctgt 
gcattgagca 



attctccaat 
ccggtaaaaa 
gaaagaaact 
gtgttaagaa 
tgccgtttta 
tcgatttcat 



<210> 442 
<211> 446 
<212> DNA 

<213> Escherichia coli 



<400> 442 . 
ttacgatagc 
tgactacctt 
tat ttaaaaa. 
atatttccgt 
aagcattgaa 
atttaacatg 
taaatggagt 
ctctaattgc 



tattagtaaa 
cgtttttttg 
ggaaaacgac 
ggagcatttt 
tgcagggaaa 
taaacgcatg 
catgttttcc 
atattttaat 



aatataagag 
attaagaatg 
atgaaaccga 
attattgaat 
aataatatgg 
gttaatcctc 
cttttccatt 
ttttct 



ttagctgtat 
attttattat 
agcacagaat 
atagaggttt 
ccataaaaaa 
a tatcacggg 
tatcaagtt c 



tgttatgtct 
cgtaagtaaa 
caacattctc 
aactccggta 
catcgaaaga 
tggagtgtta 
ctgttgccgt 



gtggcgaaat 
attacatgaa 
caatcat aaa 
aaaaacaaag 
aactctttta 
agaacat aca 
tttagtccat 



<210> 443 
<211> 388 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (388) 
<223> n = A, T , C or G 

<400> 443 

tcaccccggt gccgattttc aggcatcctg atttaactta gcacccgcaa cttaactaca 



60 
120 
180 
240 
300 
360 
376 



60 
120 
180 
240 
300 
360 
420 
446 



60 
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ggaaaacaaa gagataaatg tctaatcctg atgcaaatcg agccgatttt ttaatcttta 

cggactttta cccgccrggt ttattaattg cactgtnatc cgggcgttcg cccgctttaa 

tcaeaatagg ctgtgtagcc tgggcctgtt tctctttcac ccgcgccaga gcggcagcaa 

tcgcatcttt atctttggct gcaggttgaa cggctgcgct cttatgtcgt tcaaggcgag 

ccgctttttc gcgctccaga cgagcctggc gcgcttcgaa acgcgctttg gcttctgcgg 
cncgcttttc ttcctgacga atagccgc 

<210> 444 
<211>. 209 
<212> DNA 

<213> Escherichia coli 
<400> 444 

aattttaata acgctatctg cggataaagc agaataggtg gttaacccca gacataaacc 
gaggaaaata atgttattgt atttcataat ctattgttcc ttagcgacag attgctgtct 
gctggttcag taaggtacca ggagaaactt caggaagctt gtactcgaca atacagtttg 
agtttttatc tttgccccat gaaacctgt 

<210> 445 
<211> 341 
<212> DNA 

<213> Escherichia coli 



120 
180 
240 
300 
360 
388 



60 
120 
180 
209 



<400> 445 

catcctcaat 

acgtaatctg. 

tgtttttttg 

cgaggacgcg 

accccagaca 

cgacagattg 



accgttaaat 
aaaagggacg 
gagatctacg 
cccgct aatt 
taaaccgagg 
ctgtctgctg 



gcaacccgaa 
gctggacttg 
gtaaaattaa 
ttaataacgc 
aaaataatgt 
gttcagtaag 



cccccgttgt ccctttgctg 
tgctaccggt cgttggaaat 
gcgaatccga tgagactgtg 
tatctgcgga taaagcagaa 
tattgtattt cataatctat 
gtaccaggag a 



cattcactta 
tgtctggcac 
cagccataat 
taggtggtta 
tgttccttag 



60 
120 
180 
240 
300 
341 



<210> 446 

<211> 697 

<212> DNA 

<213> Escherichia coli 



<400> 446 

agatttactg 

acgaatcmcg 

cgggcgctga 

caggatcgtt 

tctgaatacc 

cggcaatcgc 

tttatgcgtc 

aacacgtaat 

atccagagat 

cgtgktcccc 

caatgctttt 

gataacatcc 



ccaatttccg 
ggctataccg 
ttcttcccac 
gagcagcaga 
cgccagaaaa 
aatcattttt 
ataacttcac 
tcattgcccc 
gggaatactg 
agagccacca 
tgagttatca 
gttgccagat 



gcagatcgga 
ccaggcatgg 
gcggttattt 
tgat ct acca 
ggtgccacag 
tccccttagt 
gtatgtagca 
caacattgaa 
cttttaatga 
actccgtttt 
ggtgcattct 
tggcacggat 



aagggttaam 
cttgagccat 
tggcacacac 
gttccagcgc 
cagttagctt 
acgatgaaca 
cttf tgcgat 
aacataatgc 
ctcaggtttt 
atgttgcggg 
tcatcacgtc 
gaattat 



ccatattgat 
ggcattaaat 
caga tccagc 
ctgggtgtat 
ttctcctgct 
gcggtaaaga 
tcaaaaaaga 
ttatccagat 
ttgaaatatc 
tatttttccg 
cgt kgmcaaa 



ccataagggt 
tccgcaaatt 
aaggggtttt 
tgttcct cgt 
tgcaaga tgt 
aatcgtattc 
ccattgctac 
atttgaagtt 
ccttagcaat 
cagcatcttt 
ttggcaatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
697 



<210> 447 

<211> 215 

<212> DNA 

<213> Escherichia coli 



<400> 447 

aattaataac ttttcgttag gcagttttgg gtgtgagttg caagagggga gactactgaa 60 

taactcaagt tttataatcg aggggaaaat ggtgatggcg ttcatagcaa aacgccctca 120 

accataaagg tcgagggcgc ttaagatgtt aaaaacccgc tatccgttaa aaaacaatgt 180 

tcaacraagg tcagtgacat tgcgctaaaa aagcg 215 
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<212> DNA 

<213> Escherichia coli 
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<400> 448 

gcattattca 

ttaaggttta 

gcattgctat 

taagggacag 

attgaatgtt 

aa'attcatat 

atattgaaat 



tgagaaatgt 
attatctgtg 
aatattggtt 
gcatagagta 
gacgctatgt 
aatgcgcctt 
agtccaacta 



gtatcgtaaa 
tgtgatattt 
atcatttgct 
atgatacgta 
gtttatgagg 
tgctcatgaa 
cttctttatt 



tcaactgaaa 
tattgaatgt 
gaatggattc 
tgcataacca 
gagaggtatt 
tggatgccag 
accaa 



ttaacgcaac 
tttaaatat t 
agtcttaatg 
acatctttac 
ttcagttgat 
tatgtagtgg 



catttgttat 
gtttttattg 
agtgggtttt 
tcattatgtc 
ctggattgtt 
gaaattataa 



<210> 449 
<211> 641 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc__f eature 
<222> (1) . . . (641) 
<223> n « A, T, C or G 



<400> 449 

ataatcaggt 

cgcgtggcaa 

atgatatctt 

ctatacttat 

attgattatt 

aaattaatta 

gcgctcctgt 

gnntgcncgn 

cgatngtcgg 

cgaaaggctn 

tcctgagnan 



aagaaaaggt 
tacatcagtg 
ccgatttatc 
tcagcactca 
ttcctgattg 
atgtcatcag 
catgtgcatt 
cgncccacct 
cgggngcctc 
attncaaaga 
gacaaatncc 



gcgcggagat 
gcaataaaac 
ttaatcgttt 
caaataaagg 
ggctactggt 
gtccgaaaat 
gcttcatata 
naccccatgc 
cccatgcnan 
ctgggccttn 
gccgggagcg 



taccgtgtgt 
gacatatcca 
atggataacg 
aacgccaatg 
ggtgactggc 
aacgagaat a 
atcactggcg 
cgaacttcag 
agtangggaa 
cntttatctg 
gatttgaacn 



tgcgatatat 
gaaaaatata 
gcaaagggct 
aaaattatac 
gtatttaaga 
tttcagtctc 
caaggagcgc 
aantgaaaac 
nt gccangcg 
atgtttgtcg 
t 



tttttagttt 
cactaagtga 
tcgttttttc 
tctgggctgt 
tgatatttta 
tcatcctgtt 
cgcaggcgna 
nccntaacnc 
ncnnattaaa 
gagaacgctc 



<210> 450 
<211> 314 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (314 ) 
<223> n = A, T,C or G 

<400> 450 

gaactacgag taagaatagc tncgaattcc cgtttatgga 
ttttcctata cttattcagc actcacaaat aaaggaacgc 
gctgtattga ttattttcct gattgggcta ctggtggtga 
ttttaaaatt aattaatgtc atcaggtccg aaaataacga 
-ctgttgcgct cctgtcatgt gcattgcttc atataatcac 
gggntntnnt cttt 

<210> 451 
<211> 236 
<212> DNA 

<213> Escherichia coli 



taacggcaaa 
caatgaaaat 
ctggcgtatt 
gaatatttca 
tggcgcaagg 



gggcttcgtt 
tatactctgg 
taagatgata 
gtctctcatc 
agcgcgcagg 



60 
120 
180 
240 
300 
360 
395 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
641 



60 
120 
180 
240 
300 
314 



<400> 451 
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atatacacta agtgaaTgat atcttccgat ttatcttaat cgtttatgga taacggcaaa 

gggcttcgtt ttttcctata cttattcagc actcacaaat aaaggaacgc caatgaaaat 

tatactctgg gctgtattga ttattttcct gattgggcta ctggtggtga ctggcgtatt 

taagatgata ttttaaaatt aattaatgtc atcaggtccg aaaataacga gaatat 

<210> 452 
<211> 418 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
236 



<400> 452 

cggagattac 

ataaaacgac 

atcgtttatg 

ataaaggaac 

tactggtggt 

cgaaaataac 

tcatataatc 



cgtgtgttgc 
atatccagaa 
gataacggca 
gccaatgaaa 
gactggcgta 
gagaatattt 
actggcgcaa 



gatatatttt 
aaatat acac 
aagggcttcg 
attatactct 
t ttaagatga 
cagtctctca 
ggagcgcgca 



<210> 453 
<211> 551 
<212> DNA 

<213> Escherichia coli 



ttagtttcgc 
taagtgaatg 
ttttttccta 
gggctgtatt 
tattttaaaa 
tcctgttgcg 
gggggcggcc 



gtggcaatac 
atatcttccg 
tacttattca 
gattattttc 
ttaattaatg 
ctcctgtcat 
aatcgccgcc 



atcagtggca 
atttatctta 
gcactcacaa 
ctgattgggc 
t cat caggtc 
gtgcattgct 
gccccctg 



60 
120 
180 
240 
300 
360 
418 



<400> 453 

aacaatttgc 

cgaccgccat 

cgctgcctaa 

cgctggcgga 

tgttacagga 

taactaactc 

cgtgtttgcc 

tccttttctt 

tccccrgttt 

gtaactagca 



ccatgcgctc 
tcgactgtta 
acggct ggaa 
tgtcgttcgc 
ttagtggctt 
tctgtctaaa 
ctgcattgcg 
gtggtgcgcg 
gtaatacgta 



ggtcatgcgc 
atgggcgaat 
gaagcgggtt 
tgatgtggtt 
tgcgcgataa 
cctctatcca 
ccttcttcac 
ttccgctatc 
attaaacagg 



tgcatcgccc 
cttcagtact 
ttgcgtttcg 
tacagcaaac 
gatcgtctgg 
gcatctcctg 
gtaatctgtc 
tcaccaataa 
gcttttagct 



ggccattttg 
ggtattaggt 
ctggtacgat 
atccgccagt 
tgaaagtcgg 
agcaatacgc 
agcaatggtc 
atgcacgaaa 
gtctgtcatt 



sgcgtccccg 
ggacaacgcg 
ttagaagagg 
taact cccgg 
gtcaccatca 
agggcttctt 
atcaagtttc 
acgctgggca 
agtgktccct 



<210> 454 
<211> 93 
• <212> DNA 
<213> Escherichia coli 

<4O0> 454 

tggcatctcg gtgttgccga tcttcatgat atccagcccg ccggaaactt cttcccaaac 
ggttttgctg -ttatccattg agtcacggaa ctg 

<210> 455 
<211> 232 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
551 



60 
93 



<400> 455 

cgtgccgaga tgatcctgta accatcatca gttgtgaagt agtgattcac gacttcaagg 

cgcttttcaa aagggtattt tggctttgac atattagggg ctattccatt tcatcgtcca 

acaaaatggg tgcagtacat actcgttgga aatcaacaca ggaggctggg aatgccgcag 

aaatatagat tactttcttt aatagtgatt tgtttcacgc ttttattttt ca 

<210> 456 

<211> 713 • 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
232 
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<400> 456 

ttagnggatn 

ccagcagagc 

cacgaacgcg 

ccatgctgta 

tacgaggtac 

-gagttacttt 

gtgctttggt 

ccagcagttt 

cccgagggac 

cgtctttgta 

ctgagtagga 

tggnggcagg 



naangcccac 
gcggcctt ct 
gtcgacttta 
gttaccacga 
agcaatagtg 
acagccgatc 
gatcagcggt 
tttgtcagcg 
ttgcatgaca 
gtaatcatgc 
caaatccgcc 
accccgccat 



ancctcgang 
tcgtcagatt 
tcgtagtcga 
ccgtcgaaag 
atcaggcgct 
ggatagccct 
ttttgaccgg 
atcgcttcac 
gaattgtagt 
agtttcgcca 
ggagccggat 
aactggcagc 



gatctaggag 
tcgcagtagt 
tttctgggaa 
a'cttagcgga 
caaagaactc 
gacggatttt 
agattgctgc 
caacacccat 
taaactcagt 
tcgtactact 
ttaacgttgc 
attaaattaa 



gtagaatagc 
ggtaatggta 
gatgatctgc 
caggccacgg 
ccacatgcgt 
gaagcctgca 
caggtctgct 
gttcagggtg 
catgagtttt 
ccatgtcggt 
gaacaaccgn 
gcagaaggcc 



ttcgaattcc 
atatccaaac 
tcacggacac 
aagtcacgga 
tcgccacgca 
acagatttgc 
gctgcgttat 
atcttctcga 
ttaactactt 
gaacgctctc 
cccggagggg 
ate 



<210> 457 
<211> 292 
<212> DNA 

<213> Escherichia coli 
<400> 457 

tgaacagcag. agataeggee agtgcggcca atgttttttg tcctttaaac ataacagagt 
cctttaagga tatagaatag gggtatagct aegecagaat ategtatttg attattgeta 
gtttttagtt ttgcttaaaa atattgttag ttttattaaa tgcaaaacta aattattggt 
atcatgaatt tgttgtatga tgaataaaat ataggggggt atagatagac gtcattttca 
tagggttata aatgegacta ccatgaagtt tttaattgaa agtattgggt tg 

<210> 458 
<211> 282 
<212> DNA 

<213> Escherichia coli 
<400> 458 

ttattaaatg caaaactaaa ttattggtat catgaatttg ttgtatgatg aataaaatat 
aggggggtat agatagacgt cattttcata gggttataaa tgcgactacc atgaagtttt 
taattgaaag tattgggttg ctgataattt gagctgttct attcttttta aatatctata 
taggtctgtt aatggatttt atttttacaa ttttttgtgt ttaggcatat aaaaatcaac 
ccgccatatg aacggcgggt taaaatattt acaacttagc aa 

<210> 459 
<211> 300 
<212> DNA 

<213> Escherichia coli 
<400> 459 

tctgcgttcc gctaaaaggt geaaatgetc aggacgttgc agcgttttgc gtgaccgctc 
ggggaaggca aaattgeetc tgggaaagca ttgcgcgggg tccggcgctc atcaacaatc 
ggggggcagc aaggggctga aacgggaaag cccctcccga agaaggggee ttgtataagg 
aaagggttat gatgaagctc gtcatcatac tggttgtgtt gttactgtta agtttcccga 
cttactaaca actcatcaga ggggggagaa . at cctccctt acccttgttc ctttactcta 

<210> 460 
<211>. 293 
<212> DNA 

<213>' Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



60 
120 
180 
240 
292 



60 
120 
180 
240 
282 



60 
120 
180 
240 
300 



<400> 460 
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cggqgtccgg cgctcatcaa caatcggggg gcagcaaggg gctgaaacgg gaaagcccct 

cccgaagaag gggccttgta taaggaaagg gttatgatga agctcgtcat catactggtt 

gtgttgt'tac tgttaagttt cccgacttac taacaactca tcagaggggg gagaaatcct 

cccttaccct tgttccttta ctctaggttg aaaaaacaac agcgtcaa'ta ggcctgccat 

gtacgaagcg agatctgtga accgctttcc ggttagcctt ttttatcctg ttg 

<210> 461 
<211> 359 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
293 



<400> 461 

caacacagga 

ttcacgcttt 

caggagagtt 

gcgggaaatt 

tcttcacaaa 

ataaaaaaat 



ggctgggaat 
tatttttcac 
atgagctggc 
attcccgcct 
aataataata 
ccccgccaaa 



gccgcagaaa 
ctggatgata 
ggcgttttta 
tacttacggc 
gattttatta 
tggcagggat 



tatagattac 
agagat tcac 
gcctgcaaat 
gttgcgcatt 
cgcgatcgat 
cttagattct 



tttctttaat 
tgtgtgaatt 
tgaaagagta 
ctcattgcac 
tatttatttc 
gtgcttttaa 



agtgatttgt 
gcatat taaa 
agagtcttcg 
ccaaatttat 
ctgaaaacaa 
gcagagatt 



60 
120 
180 
240 
300 
359 



<210> 462 
<211> 673 
<212> DNA 

<213> Escherichia coli 



<400> 462 

gcaacccatg 

ccatcatcag 

ggctttgaca 

ctcgttggaa 

atagtgattt 

ttgcatatta 

taagagtctt 

acccaaattt 

tcctgaaaac 

aagcagagaa 

ggtgaaggac 

gaagtgtacg 



tcctgacctg 
ttgtgaagta 
tattaggggc 
atcaacacag 
gtttcacgct 
aacaggagag 
cggcgggaaa 
attcttcaca 
aaataaaaaa 
tacaggctgg 
actttctgac 
aac 



ggttcggggg 
gtgattcacg 
tattccattt 
gaggctggga 
tttatttttc 
t tatgagctg 
ttattcccgc 
aaaataataa 
atccccgcca 
ttacgttacc 
cttcgtccag 



acaccaaaac 
acttcaaggc 
catcgtccaa 
atgccgcaga 
acctggatga 
gcggcgtttt 
cttacttacg 
tagattttat 
aatggcaggg 
agctgccggg 
agatttgtaa 



gtgccgagat 
gcttttcaaa 
caaaatgggt 
aatatagatt 
taagagattc 
tagcctgcaa 
gcgttgcgca 
tacgcgatcg 
atcttagatt 
cctttagcgc 
ccatcgttct 



gatcctgtaa 
agggtatttt 
gcagtacata 
actttcttta 
actgtgtgaa 
attgaaagag 
ttctcattgc 
attatttatt 
ctgtgctttt 
cgctttcgat 
ggatagcaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
673 



<210> 463 
<211> 630 
<212> DNA 

<213> Escherichia coli 



<400> 463 

tggtggcatt 

caccacgggg 

acaacacagg 

cggctccggt 

agtaacgtgt 

caggggggaa 

cgttcaggcg 

agaccggtgc 

tgaaaatctt 

gttgttcgat 

caacggggac 



ggttgctgga 
gctaatcttg 
cggctatatg 
cattgctggc 
catgcgggcg 
actctgcggc 
aacgtacggt 
tgtttgtggc 
gccatcgaga 
gacaatcggt 
gtcggtgata 



gagagaaaac 
actctagacc 
acgttcgcag 
attcttgcca 
tcaggctgcc 
ctttttcgtt 
ttatcctggt 
aacccactgg 
acgcgaacca 
gccccctgat 
gacgggctag 



ccccgcacgt 
actcaagaat 
agctgggcat 
gtatgatcgt 
gtaatggcaa 
cttactgcgg 
attgaataac 
tgagtttttt 
ccagatcgga 
gcggtgcctt 



tgcaggtatg 
agccgcgaaa 
ggccttctgg 
gaactggctg 
tttgcgcccg 
gtaaggcacc 
tactgcattt 
ccagtcaaca 
gatagccagg 
catgccgaag 



cacctgacaa 
cgttgtcatt 
catgatttag 
aacaagcgga 
gaccaggccg 
cagtcgccgc 
gagttct egg 
ttgtctt egg 
aagctgeteg 
aatttcaccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
630 



<210> 4 64 
<211> 391 
<212> DNA 

<213> Escherichia coli 
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<400> 464 * 
ctcaggctgc 
gagagaatca 
aaaaaatttg 
gttgctgctt 
tggcccagcg 
caaccagcgc 
ttgtctccag 



tgattgtttt 
taaacgtggt 
agggataagg 
gagggtcgtt 
gaccagcata 
ccaggagcgc 
ccattcacga 



tttgtgcaat 
gaatgatgat 
ccggaatggc 
ttaaccagac 
ttaggatggc 
agacttagca 
cgactggcgg 



ggcgcggtat 
tgttagcaag 
tccggccaga 
gccaggcgct 
gaatcgtcca 
gcatattcca 
a 



tagcgtcgtt 
gaaaactgtc 
gggaagttaa 
ccatacgcca 
gatcgccatc 
gcgacgatcg 



gctgtcgatg 
aaaaat cttc 
ccgcgaagct 
aaaccgcgtc 
acgctactgc 
taagcgcctg 



<210> 465 
<211> 625 
<212> DNA 

<213> Escherichia coli 



<400> 465 

aacacaccac 

gtactgggaa 

gactaatgaa 

ggcctgattt 

ggatagattt 

ttgacacagt 

acattttgtt 

gcccggtccg 

cctgctgtca 

aactggcaaa 

atgataaatt 



accataaacg 
taattttatt 
cggagataat 
ttatgacggc 
gtgttttgct 
agcatcagtt 
cgtcaataaa 
ctcttccaat 
cggtaatgtt 
ttctgacacc 
cttcgccccc 



gaggcaaata 
ttctggtttt 
ccctcaccta 
gaaaaaaaac 
tttacgctaa 
ttctcaatga 
catgcagcga 
gaccacatcc 
gatatgccct 
tgcacgacat 
gtagc 



<210> 466 
<211> 623 
<212> DNA 

<213> Escherichia coli 



atgctgggta 
ctggccgcgt 
accggcccct 
cgccagtaaa 
caggcatttt 
atgttaaacg 
tttcttccgg 
agaggctctt 
tcagaatgtg 
gcttcttcat 



atatgaatgt 
atttcagcca 
tgttacagtt 
ccggcggtga 
cctgcactga 
gagcttaaac 
tttgcttacc 
caggaaatgc 
tgatggcatg 
cattagccgc 



tttaatggcc 
caaatgggat 
gtgtacaagg 
atgcttgcat 
taacgaatcg 
tcggttaatc 
ctcatacatt 
gcgactcaca 
gttatcgact 
tttgacaata 



60 
120 
180 
240 
300 
360 
391 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
625 



<400> 466 
tgcttttgaa 
aaaacgataa 
, tctttcggta 
agtggctcgt 
ctcaactgta 
ccaccgaagc 
atccataaag 
aaataacgac 
gatgtccgcc 
cacagttaaa 
aactggttgc 



tatgtgctcg 
tctgaaagag 
tcagccagag 
tagcaaaaag 
agggtattga 
ccggaggtgg 
agtcggtctt 
attgctgtgt 
ctttttaaag 
agtcatgtta 
agtcacctgg 



caatcttgag 
ccaaggtatt 
agtgagacca 
tttcaacaat 
aatgttaaca 
tgaaat aaaa 
gtctgttaaa 
gtagtcctgg 
tgaattttgt 
gtccttattg 
agg 



aaggaaatgg 
tcagtataag 
aaaatgataa 
ggagtaaata 
caagctctcg 
ccgggcacaa 
tttaaatggt 
cggcatcagt 
gatgcggtga 
gtttgggtgg 



cgaccacgaa 
cattgaatgc 
tcgtatacat 
catccaacat 
ctgtaggggt 
cacgaaggcg 
gggagtgcgc 
ttttttcttg 
atgcggctaa 
gaaagccgac 



agaaaaggca 
gacagtaaac 
aagtctttcg 
atcaataact 
atagccgaga 
catttccgat 
ctccggttgt 
aagttcggct 
gcgcacgtgg 
tgtaattgtt 



<210> 467 
<211> 234 
<212> DNA 

<213> Escherichia coli 
<400> 467 

tgtttactta caagagattc atctttgtat aaataaagat aagtaattac gcataaaaca 
acaatgatta taatagcaaa aataaatatt atcatctttg atagattact tgagatagcc 
agcatcttgt aaagccttta tcgttttttt atgctctgga ttaatataat cactacatct 
atctgagcaa tctgttgttg atggacat gt - caacccatgg tcatttacag ccaa 

<210> 468 
<211> 529 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
623 



60 
120 
180 
234 



-221- 



BNSDOCID: <WO 0044906A2_I_> 



WO 00/44906 




PCT/USOO/02200 



<400> 468 

attagctatt tcggctaaaa tagagactac atgtcttcgg tccatctcac ttaaggagtg 60 

tagttccgtt gtaagttttt ccatagcttg cactgctaaa tttcgaacaa ggaattttct 120 

gctggtaatc tctaaaaaga tggcatggtt tacaatgatt tttgtttcct tttgattatt 180 

atgaacaact gtccatgatt tcgtttaaga atgaagagaa atcactaaac gaactgaata 240 

tattttctgt gccaatatta tctctaattt caaaaaagtt acttttaatg tcggtaatga 300 

ctccaactta ttgatagtgt tttatgttca gataatgccc gatgactttg tcatgcagct 360 

ccaccgattt tgagaacgac agcgacttcc gtcccagccg tgccaggtgc tgcctcagat 420 

tcaggttatg ccgctcaatt cgctgcgtat atcgcttttc cttatcagtt cgttgatgtc 480 

agtggttttg accacgaggg agcttcacgc gagttattga aaaccctga 529 



<210> 469 
<211> 261 
<212> DNA 

<213> Escherichia coli 



<400> 469 

caaagaacct tcaacatgaa aaatatccat ttgtttgcaa aaaaagatta ttaggaagga 60 

aattaatgca attatcgaaa attcaaaaaa tatccaaaaa tagtatactt tattccagaa 120 

gagttcaata taatgtttgt cttcaatttt tcttacttca gggtaatata gattgctcat 180 

tacattgtga gcttcatctt tatttaattt tctgttgact ccagctc'tcc gtgataacgg 240 

ttttataatt agatgcttat c ■ . ~ ~ 



<210> 470 
<211> 98 
<212> DNA 

<213> Escherichia coli 



<400> 470 

agatgattgc cgggaacttg ttagcggcac gcaggcggcg gctcgcaccc ttaccctgct 60 
ctttacgtac ttctgcgttg atagtaaaca tttctttc 98 

<210> 471 
<211> 259 
<212> DNA 

<213> Escherichia coli 
<400> 471. 

agcgcgaacg aagtcgatgt gctgcagctt cggtttgtac gggtgacgct gtacgtcctg 60 
agctttaact ttgatrtctt taccgtcaac aacgatggtc agaacttcgc tgtagaattc 120 
agctttagct tgcatgttca tgactttgtc gtgatccagc tcgatagcca gcggcgcttc 180 
tttgccaccg tagatgattg ccgggaactt gttagcggca cgcaggcggc ggctcgcacc 240 
cttaccctgc tctttacgt 259 

<210> 472 
<211> 94 
<212> DNA 

<213> Escherichia coli 
<400> 472 

aaaaacggcg taaagaaagg atgcaaacat gttaataaaa actcaaattg atcccacgta 60 
tatattacgc cgcaaaatcc ttacaataaa cagg 94 

<210> 473 
<211> 174 
<212> DNA 

<213> Escherichia coli 
<400> 473 

ttaattatta aaatagtgta acgcgattat gtggttatgg gggtaaacat taaataaacc 60 
agcggggagg ggaggtaaag tgaaaaaata aaaagcggat aatcttaata agcaggccgg 120 



-222- 



BNSDOCID: <WO 0O44906A2_l_> 



WO 00/44906 



PCT/US00/02200 



acagcatcgc catccggcac tgatacgagg tttatttcag ctcatcaacc atcg 

<210> 474 
<211> 138 
<212> DNA 

<213> Escherichia coli 
<400> 474 

ctgtaaaaac gtcaaaaaga gtgttttatc aacagaagaa tggaggtctg acagatagta 
gtaatgcaaa aaaatggaga cttaagttga atgaacggga gtaaagcgaa aagactatag 
agtgaaggag aaattccc 

<210> 475 
<211> 191 
<212> DNA 

<213> Escherichia coli 
<400> 475 

tttgttggct taatattcta ttgttatctt tatttataga tgtttatatt gcatgaggtg 
gtttttggag agaagaatga ggaagatgcg tcgagccaca gaaacgttag ctttacatat 
agcggaggtg atgtgaaatt aatttacaat agaaataatt tacatatcaa acagttagat 
gctttttgtc g 

<210> 476 
<211> 245 
<212> DNA 

<213> Escherichia coli 
<400> 476 

cggccattta tacaggaaaa gcctatgtca gaacgtaaaa actcaaaatc acgccgtaat 
tatctcgtta aatgttcctg cccaaactgc acccaagagt cagaacacag tttttcaaga 
gtacaaaaag gtgccctttt gatctgccct cattgcaaca aagtattcca gacaaatctt 
aaagctgtag cctgattgat tttattagta acaagtattt ttt atatttt. aataatatat 
ttaaa 

<210> 477 
<211> 319 
<212> DNA 

<213> Escherichia coli 



174 



60 
120 
138 



60 
120 
180 
191 



60 
120 
180 
240 
245 



<400> 477 

aaattttcag gtaccttgtc accatacttt tttttctgag cattaatgat 

tcttgaggat ctttaactcc ccacatttgg tggaaagtat tcatattaaa 

aataatttgt ctttataaat cgccagtgga gaattagtaa aacgattaaa 

tcattaacgt aatcccatat atatttatca ttggtatgaa aaatatgtgc 

tgaatctgga taccctcaca gtcctctgtg tacgcatttc caccgatatg 
tcaatcacta aaacttttt 

<210> .478 
<211> 149 
<212> DNA 

<213> Escherichia coli 



attttgagct 
aggaaggttg 
ttctactaaa 
accatattta 
atttcttttc 



60 
120 
180 
240 
300 
319 



<400> 478 

gcagtgatcg aagcgatgac gaagtgtatg gaaaaatcag aaaaactcag caaatcctga 
tgactttcgc cggacgtcag gccgccactt cggtgcggtt acgtccggct ttctttgctt 
tgtaaagcgc caaatctgcc gatttcaac 

<210> 479 
<211> 330 
<212> DNA 



60 
120 
149 
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<400> 479 

gaaagtatct 

gaagcgcgta 

aactacagat 

tatttagatt 

acccacgttc 

ataggcgctg 



tcgttattga catcactgga aaatataact 

ccgtatctgg acaaacattt atcgagctta 

aacatttgcg cgtcctttgc agtaatgccc 

aaattaccag tattt cttcg gagtgaagaa 

gcggaccagt cttgatctac gtcaccacca 

aagttcttgg tgaagctaaa 



tgcttttcat tattaaactc 
ccaaattcct gaagaggttt 
gtcaaatcct tgacgggcat 
tattaccagg tatatttaac 
ccgaggtagt tagcatcggt 



<210> 480 
<211> 191 
<212> DNA 

<213> Escherichia coli 
<400> 480 

tttttttcca gcaacggagc aaaaggtttg cccttgtgca gcticagggtt aaccacttta 
actacgtggc gacgacccgg agatgtcggt ttacatttaa caactgccat tgtattactc 
ctccgactta ctcagcgccg ccaacgaagt ccagattctg gccttctttc agggtgacgt 
aagctttttt c 

<210> 481 
<211> 188 
<212> DNA 

<213> Escherichia coli 
<400> 481 

tccctttaac taccagggtg ttaacgactt cgacttcgac ttcaaacagt ttctgcacag 
cagctttgat ttctgctttg gtcgcgtctt tagcaacttt gagtacgatg gtgttggatt 
tttccatcgc agtagacgct ttttcagaaa cgtgcggtgc acgcagcacc ttcagcagac 
gttcttca 

<210> 482 
<211> 172 
<212> DNA 

<213> Escherichia coli 
<4 00> 4 82 

caaaggcgaa caaagcctgt gaagcccgaa ggctccacag acagtgctac ttgaaggcct 
tactgtttct tcttaggagc gagcaccatg atcatctggc ggccttcgat cttcgttggg 
aaggattcga ccactgccag ttcttgcaaa tcgtctttca cgcgattaag ca 

<210> 483 

<211> 266 . ' 

<212> DNA 

<213> Escherichia coli 
<400> 483 

tggagaaaac gggtgattga taaagcaatc atcgttctag gggcgttaat tgcgctgctg 
gaactgatcc gctttctgct tcagcttctg aactgatagc ggaaacgtaa ttaagggcta 
agagcacact actcttagcc ctttaacatt taacgcattg tcacgaactc ttctgccgcc 
gttgggtgaa tggcgacggt attgtcgaag tcttttttgg ttgcccccat cttcagcgcc 
accgcgaagc cctgcaacat ttcgtc 

<210> 484 
<211> 259 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
330 



60 
120 
180 
191 



60 
120 
180 
188 



60 
120 
172 



60 
120 
180 
240 
266 



<400> 484 

cgcaggcagc tgatggtcaa caggatgaga gaaacccaga gacaggttaa tcacattgcc 



60 
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tttaaccgct gcacggtaac ctacaccaac cagctgcagc ttcttagtga agccttcggt 

aacaccgata accattgagt tcagcagggc acgcgcggta ccagcctgtg cccaaccgtc 

tgcgtaacca tcacgcggac cgaaggtcag ggtattatct gcatgtttaa cttcaacagc 
atcgttgaga gtacgagtc 



120 
180 
240 
259 



<210> 485 
<211> 73 
<212> DNA 

<213> Escherichia coli 
<400> 485 

caggtcggaa cttacccgac aaggaatttc gctaccttag gaccgttata gttacggccg 60 



ccgtttaccg ggg 



73 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 
1. I j Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2] Claims nos.: H 13 34-45 47 48 50 51 53 55 57-63 65 67-93 95-105 107-110 

because they relate to parts of the international Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box li Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 . [ | As all required additional search fees were timely paid by the applicant, this International Search Report covers all 



searchable claims. 



2. [" ] As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I w I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
t- 4 ^ covers only those claims for which fees were paid, specifically claims Nos.: 

1-10, 12, 14-33, 46, 49, 52, 54, 56, 64, 66, 94 and 106, all partially 



4. [ | No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest \ 1 The additional search fees were accompanied by the applicant's protest. 




No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ £10 



Continuation of Box 1.2 

Claims Nos.: 11 13 34-45 47 48 50 51 53 55 57-63 65 67-93 95-105 107-110 



In view of the large number and also the wording of the claims presently 
on file, which render it difficult, if not impossible, to determine the 
matter for which protection is sought, the present application fails to 
comply with the clarity and conciseness requirements of Article 6 PCT 
(see also Rule 6.1(a) PCT) to such an extent that a meaningful search is 
impossible. Consequently, the search has been carried out for those parts 
of the application which do appear to be clear (and concise), namely the 
nucleic acid sequences as identified in claims 1 and 8 respectively, 
sequences related to said sequences as well as their use. This 
corresponds to the subject-matter of claims 
1-10,12,14-33,46,49,52,54,56,64,66,94 and 106. 

It should be noted that since claim 46 has been searched, the 
subject-matter of claims 35-45 has been searched restricted to the gene 
products of claim 46, i.e. for those gene products for which (additional) 
search fees have been paid 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-7, 12, 49, 52, 56, 66, all partially 

Invention 1: 

A purified or isolated nucleic acid sequence consisting of 
SEQ.ID. No.: 405, a vector comprising said sequence, a host 
comprising said vector, the use of said sequence for 
inhibiting cellular proliferation, a composition comprising 
said sequence, the use of said sequence for inhibiting the 
expression of a gene and the use of said nucleic acid 
sequence for identifying bacterial strains. 



2. Claims: 1-7,' 12, 49, 52, 56, 66, all partially 
Inventions 2 to 81: 

Idem as invention 1, but for SEQ.ID. NO's 406-485 respectively 



3. Claims: Claims 8-10,12,14-33,46,54,64,66,94 and 106, 
all partially: 



Invention 82: 

A purified or isolated nucleic acid consisting of 
SEQ.ID. No.: 82, a vector comprising said nucleic acid 
sequence, a host comprising said vector, a polypeptide 
encoded by said nucleic acid sequence and having the 
sequence of SEQ.ID. No.: 243, an antibody binding said 
polypeptide, a method for producing said polypeptide, a 
method for identifying compounds influencing the activity of 
said polypeptide, a method for identifying compounds 
influencing the level of said polypeptide, a method for 
inhibiting the expresion of said nucleic acid, the use of 
said nucleic acid sequence for identifying bacterial strains 
and the use of said nucleic acid sequence for identifying 
proliferation inhibitors. 



4. Claims: Claims 8-10,12,14-33,46,54,64,66,94 and 106, 
all partially: 



Inventions 83 to 242: 

Idem as invention 82, but for SEQ.ID. No's 83-88, 90-242 (and 
their corresponding polypeptide sequences, see Table II) 
respectively. 
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